Modified fc proteins comprising site-specific non-natural amino acid residues, conjugates of the same, methods of their preparation and methods of their use

ABSTRACT

Provided herein are modified Fc proteins comprising non-natural amino acid residues at site-specific positions, conjugates of the modified Fc proteins for therapy or diagnosis, compositions comprising the modified Fc proteins and conjugates thereof, methods of their production and methods of their use. The modified Fc proteins and conjugates are useful for methods of treatment and prevention, methods of detection and methods of diagnosis.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 13/928,182, filed Jun. 26, 2013, and now issued as U.S. Pat. No. 9,732,161, which in turn claims the benefit under 35 U.S.C. §119(e) of U.S. Provisional Application No. 61/664,686, filed Jun. 26, 2012, and U.S. Provisional Application No. 61/725,439, filed Nov. 12, 2012. Each of the foregoing applications is incorporated herein by reference in its entirety.

SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jun. 23, 2017, is named Sequence_Listing.txt and is 17,423 bytes in size.

FIELD

Provided herein are modified Fc proteins comprising non-natural amino acid residues at site-specific positions, compositions comprising the modified Fc proteins and conjugates thereof, methods of their production and methods of their use.

BACKGROUND

Antibodies or immunoglobulins comprise two functionally independent parts, a variable domain known as “Fab,” which binds antigen, and a constant domain known as “Fc,” which links to such effector functions as complement activation and attack by phagocytic cells. An immunoglobulin Fc domain has a long serum half-life, whereas a Fab domain is short-lived. See, for example, Capon et al., 1989, Nature 337: 525-531, which is hereby incorporated by reference herein in its entirety.

Immunoglobulin Fc domains and fragments thereof have found widespread use as carrier or conjugate proteins for a variety of therapeutic and diagnostic molecules. When constructed together with, for example, a therapeutic protein or peptide, an immunoglobulin Fc domain can provide longer half-life, or can incorporate such functions as Fc receptor binding, protein A binding, complement fixation and perhaps even placental transfer. As a carrier or conjugate, an Fc domain or fragment can be superior to other conjugates, e.g., albumin and PEG: an Fc domain or fragment provides more stability, longer half-life, and reduced immunogenicity to the molecules attached thereto. For example, attachment of a drug to an Fc domain can increase the serum half-life of the drug and reduce the risk of inducing immune responses.

Various methods have been used to attach therapeutic and/or diagnostic molecules to an Fc domain or fragment. For example, conventional approaches for chemical conjugation to the immunoglobulin Fc domain include random coupling to naturally occurring primary amines such as lysine and the amino-terminus or carboxylic acids such as glutamic acid, aspartic acid and the carboxy terminus. Alternatively, semi-selective site-specific coupling may be achieved through N-terminal conjugation under appropriate conditions, or derivatized carbohydrates as found on Fc proteins isolated from eukaryotic sources, or by partial reduction and coupling of native cysteine residues. (E.g., Kim et al., A pharmaceutical composition comprising an immunoglobulin Fc region as a carrier, WO 2005/047337). While each of these approaches has been applied successfully, they typically suffer from varying degrees of conjugate heterogeneity, relatively low yields and sometimes, significant losses in functional activity.

In addition, modifications have been made to Fc domains and/or fragments to optimize their function as carrier or conjugate proteins. For example, numerous fusions of proteins and peptides have been engineered at either the amino- or carboxy-terminus of an Fc domain and/or fragment thereof. Also, a variety of enzymes and synthetic reporter molecules have been chemically conjugated to the side chains of non-terminal amino acids as well as the derivatized carbohydrate moieties of the Fc domain. Further, polymers such as polyethylene glycol (PEG) have been conjugated to the Fc domain for the purpose of improved half-life in vivo and reduced immunogenicity.

However, there are problems associated with existing Fc-based conjugates, including adverse or less optimal effects on the specificity, efficiency, yield, solubility, and activity of the therapeutic or diagnostic molecules. There is a need for better Fc-based carrier proteins to further improve the properties of the therapeutic or diagnostic molecules conjugated thereto; in particular, to further increase their half-life in serum.

SUMMARY

Provided herein are modified Fc proteins modified at one or more site-specific positions with one or more non-natural amino acid residues. These site-specific positions are optimal for substitution of a natural amino acid residue with a non-natural amino acid residue. In certain embodiments, substitution at these site-specific positions yields Fc proteins that are uniform in substitution, i.e. that are substantially modified in the selected position. In certain embodiments, a modified Fc protein substituted at one or more of these site-specific positions has advantageous production yield, advantageous solubility, advantageous binding and/or advantageous activity. The properties of these modified Fc proteins are described in detail in the sections below.

In one aspect, provided herein are Fc proteins comprising a polypeptide chain having at least one non-natural amino acid residue at a position in the polypeptide chain that is optimally substitutable. The modified Fc protein can be a monomer or dimer. Said dimers can be homodimers or heterodimers. The position in the polypeptide chain that is optimally substitutable is any position in the polypeptide chain that can provide a substitution with optimal yield, uniformity, solubility, binding and/or activity. The sections below describe in detail the optimally substitutable positions of such polypeptide chains. Also described below are useful Fc proteins containing useful non-natural amino acids.

In a further aspect, provide herein are conjugates of the Fc proteins with one or more payload molecules. The payload molecule can be any molecule deemed useful for conjugating to a modified Fc protein. In certain embodiments, the payload molecule can be a therapeutic molecule or a diagnostic molecule. The payload molecule can be linked to the Fc protein directly via a covalent bond or indirectly via a linker. Advantageously, in certain embodiments, the non-natural amino acids of the modified Fc proteins provide sites useful for linking to the linker or to the payload molecule. Accordingly, provided herein are conjugates comprising a modified Fc protein linked to a payload moiety through a non-natural amino acid at an optimally substitutable site of the Fc protein.

In another aspect, provided herein are compositions comprising said modified Fc proteins or conjugates thereof. Advantageously, such compositions can have high uniformity because of the uniformity of the substitution of the modified Fc proteins provided herein. In certain embodiments, the compositions comprise a substantial amount of the modified Fc protein or conjugate thereof when measured by total weight of protein or when measured by total weight of Fc protein or conjugate. In certain embodiments, the compositions comprise at least 80% of the modified Fc protein or conjugate thereof, at least 85% of the Fc protein or conjugate, at least 90% of the modified Fc protein or conjugate thereof, or at least 95% of the Fc protein or conjugate by weight.

In another aspect, provided herein are methods of making the modified Fc proteins. The modified Fc proteins can be made by any technique apparent to those of skill in the art for incorporating non-natural amino acids into site-specific positions of Fc protein chains. In certain embodiments, the modified Fc proteins are made by solid phase synthesis, semi-synthesis, in vivo translation, in vitro translation or cell-free translation.

In another aspect, provided herein are methods of making the conjugates of the modified Fc protein (also referred to as the Fc protein conjugates). The Fc protein conjugates can be made by any technique apparent to those of skill in the art for incorporating non-natural amino acids into site-specific positions of Fc protein chains and for linking the Fc proteins to payload molecules. In certain embodiments, the modified Fc proteins are made by solid phase synthesis, semi-synthesis, in vivo translation, in vitro translation or cell-free translation.

In another aspect, provided herein are methods of using the Fc protein conjugates for therapy. Modified Fc proteins directed to a therapeutic target can incorporate one or more site-specific non-natural amino acids according to the description herein. These Fc protein conjugates can be used for treating or preventing a disease or condition associated with the therapeutic target. Advantageously, a site-specific non-natural amino acid is used to link the Fc protein to a therapeutic payload to facilitate efficacy. Exemplary Fc protein conjugates, therapeutic targets and diseases or conditions are described herein.

In another aspect, provided herein are methods of using the Fc protein conjugates for detection. Fc protein conjugates can incorporate one or more site-specific non-natural amino acids according to the description herein. These modified Fc proteins can be used with a label to signal binding to the detection target. Advantageously, a site-specific non-natural amino acid can be used to link the modified Fc protein to a label to facilitate detection. Exemplary Fc protein conjugates, detection targets and labels are described herein.

In another aspect, provided herein are methods of modifying the stability of payload molecules. Fc proteins can be modified with a non-natural amino acid as described herein to facilitate linking to a payload molecule thereby modifying the stability of the payload molecule. For instance, a payload molecule can be linked to an Fc protein to increase the in vivo stability of the payload molecule. Exemplary payload molecules and linking moieties are described herein.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 provides the structure of exemplary cytotoxic reagent DBCO-MMAF.

FIG. 2 provides a single peak hydrophobic interaction chromatography (HIC) assay profile.

FIG. 3 provides a not-well-resolved (NWR) HIC assay profile.

FIG. 4 provides a well-resolved (WR) HIC assay profile.

FIGS. 5A and 5B depict exemplary HIC traces of two variants (HC T110 and HC S112), showing peaks corresponding to unconjugated, partially conjugated and fully conjugated IgGs.

FIGS. 6A-6E depict suppression efficiency and soluble yield comparison data of antibodies comprising different unnatural amino acids.

FIGS. 7A and 7B depict suppression efficiency and soluble yield comparison data of antibodies comprising different unnatural amino acids

FIG. 8 depicts cell killing of CD-30 positive cell lines by exemplary brentuximab antibody-drug conjugates.

FIG. 9 provides experimental results demonstrating that exemplary brentuximab antibody-drug conjugates do not kill cells that do not express CD-30.

FIG. 10 depicts binding to CD-30 positive cell lines by exemplary brentuximab (ADCETRIS®) antibody-drug conjugates.

FIGS. 11A, 11B, 11C, and 11D provide the structure of exemplary cytotoxic reagents DBCO-MMAF 2, DBCO-DM4, DBCO-DM4 2, and DBCO-MMAE, respectively.

FIG. 12 provides a graphical representation of the pharmacokinetics of an exemplary scFv-Fc site-specific antibody-drug conjugate.

FIGS. 13A and 13B depict a graphical representation of the in vivo effectiveness of exemplary site-specific antibody-drug conjugates to retard tumor growth and/or to regress tumor size in an animal model.

DESCRIPTION OF EXEMPLARY EMBODIMENTS

Provided herein are modified Fc proteins having non-natural amino acids at one or more site-specific positions, compositions comprising the modified Fc proteins and conjugates thereof, methods of making the modified Fc proteins and conjugates thereof, and methods of their use.

Definitions

When referring to the modified Fc proteins or conjugates thereof provided herein, the terms used have the following meanings unless indicated otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of ordinary skill in the art. In the event that there is a plurality of definitions for a term herein, those in this section prevail unless stated otherwise.

As used herein, the singular forms “a,” “an,” and “the” include the plural reference unless the context clearly indicates otherwise. Thus, for example, reference to an “Fc protein” is a reference to one or more such Fc proteins, etc.

The term “substantially pure” with respect to a composition comprising a modified Fc protein refers to a composition that includes at least 80, 85, 90 or 95% by weight or, in certain embodiments, 95, 98, 99 or 100% by weight, e.g. dry weight, of the modified Fc protein relative to the remaining portion of the composition. The weight percentage can be relative to the total weight of protein in the composition or relative to the total weight of Fc proteins in the composition. Purity can be determined by techniques apparent to those of skill in the art, for instance SDS-PAGE.

The term “isolated” refers to an Fc protein a modified Fc protein or a conjugate thereof that is substantially or essentially free of components that normally accompany or interact with the antibody as found in its naturally occurring environment or in its production environment, or both. Isolated antibody preparations have less than about 30%, less than about 25%, less than about 20%, less than about 15%, less than about 10%, less than about 5%, less than about 4%, less than about 3%, less than about 2% or less than about 1% of contaminating protein by weight, e.g. dry weight.

The term “antibody” refers to any macromolecule that would be recognized as an antibody by those of skill in the art. Antibodies share common properties including binding and at least one polypeptide chain that is substantially identical to a polypeptide chain that can be encoded by any of the immunoglobulin genes recognized by those of skill in the art. The immunoglobulin genes include, but are not limited to, the κ, λ, α, γ (IgG1, IgG2, IgG3, and IgG4), δ, ε and μ constant region genes, as well as the immunoglobulin variable region genes. The term includes full-length antibodies and antibody fragments recognized by those of skill in the art, and variants thereof. The term further includes glycosylated and aglycosylated antibodies.

The term “antibody fragment” refers to any form of an antibody other than the full-length form. Antibody fragments herein include antibodies that are smaller components that exist within full-length antibodies, and antibodies that have been engineered. Antibody fragments include but are not limited to Fv, Fc, Fab, and (Fab′)₂, single chain Fv (scFv), diabodies, triabodies, tetrabodies, bifunctional hybrid antibodies, CDR1, CDR2, CDR3, combinations of CDR's, variable regions, framework regions, constant regions, and the like (Maynard & Georgiou, 2000, Annu. Rev. Biomed. Eng. 2:339-76; Hudson, 1998, Curr. Opin. Biotechnol. 9:395-402).

The term “immunoglobulin (Ig)” refers to a protein consisting of one or more polypeptides substantially encoded by one of the immunoglobulin genes, or a protein substantially identical thereto in amino acid sequence. Immunoglobulins include but are not limited to antibodies. Immunoglobulins may have a number of structural forms, including but not limited to full-length antibodies, antibody fragments, and individual immunoglobulin domains including but not limited to V_(H), Cγ1, Cγ2, Cγ3, V_(L), and C_(L).

The term “immunoglobulin (Ig) domain” refers to a protein domain consisting of a polypeptide substantially encoded by an immunoglobulin gene. Ig domains include but are not limited to V_(H), Cγ1, Cγ2, Cγ3, V_(L), and C_(L).

The term “variable region” of an antibody refers to a polypeptide or polypeptides composed of the V_(H) immunoglobulin domain, the V_(L) immunoglobulin domains, or the V_(H) and V_(L) immunoglobulin domains. Variable region may refer to this or these polypeptides in isolation, as an Fv fragment, as a scFv fragment, as this region in the context of a larger antibody fragment, or as this region in the context of a full-length antibody or an alternative, non-antibody scaffold molecule.

The term “variable” refers to the fact that certain portions of the variable domains differ extensively in sequence among antibodies and are responsible for the binding specificity of each particular antibody for its particular antigen. However, the variability is not evenly distributed through the variable domains of antibodies. It is concentrated in three segments called Complementarity Determining Regions (CDRs) both in the light chain and the heavy chain variable domains. The more highly conserved portions of the variable domains are called the framework regions (FR). The variable domains of native heavy and light chains each comprise four FR regions, largely adopting a β-sheet configuration, connected by three or four CDRs, which form loops connecting, and in some cases forming part of, the β-sheet structure. The CDRs in each chain are held together in close proximity by the FR regions and, with the CDRs from the other chain, contribute to the formation of the antigen binding site of antibodies (see Kabat et al., Sequences of Proteins of Immunological Interest, 5th Ed. Public Health Service, National Institutes of Health, Bethesda, Md. (1991)).

The constant domains are not typically involved directly in binding an antibody to an antigen, but exhibit various effector functions. Depending on the amino acid sequence of the constant region of their heavy chains, antibodies or immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be further divided into subclasses (isotypes), e.g. IgG1, IgG2, IgG3, and IgG4; IgA1 and IgA2. The heavy chain constant regions that correspond to the different classes of immunoglobulins are called α, δ, ε, γ and μ, respectively. Of the various human immunoglobulin classes, only human IgG1, IgG2, IgG3 and IgM are known to activate complement.

The term “Fc protein” refers to any macromolecule that would be recognized as an Fc protein by those of skill in the art. An Fc protein generally corresponds to the Fc region (fragment crystallizable region) of an antibody, as known to those of skill in the art. Fc proteins share common properties including binding to one or more Fc receptors. Fc proteins corresponding to IgG, IgG and IgD antibodies comprise domains corresponding to the C_(H)2 and C_(H)3 domains of these antibodies. Fc proteins corresponding to IgM and IgE antibodies comprise domains corresponding to C_(H)2, C_(H)3 and C_(H)4 domains of these antibodies. In certain embodiments, the Fc proteins are glycosylated. In certain embodiments, the Fc proteins are dimers. In certain embodiments, the Fc proteins are homodimers. In certain embodiments, the Fc proteins are heterodimers. In certain embodiments, the dimers are linked via a disulfide bond. In certain embodiments, the dimers are linked by an amino acid or a peptide bridge. In certain embodiments, Fc proteins do not comprise a variable domain. In certain embodiments, Fc proteins do not comprise a light chain. In certain embodiments, Fc proteins do not comprise a variable domain or a light chain.

The term “conjugate” refers to any moiety that can be connected to a modified Fc protein. In some embodiments, the terms “conjugate” and “payload” are used interchangeably. A conjugate can be a small molecule or a macromolecule. In some embodiments, the conjugate is a bioactive molecule including but not limited to a protein, a peptide, a nucleic active or a hybrid thereof. In some embodiments, the conjugate is a polymer such as polyethylene glycol. In some embodiments, a conjugate is a therapeutic agent, including a commercially available drug. In some embodiments, a conjugate is a label that can recognize and bind to specific targets, such as a molecular payload that is harmful to target cells or a label useful for detection or diagnosis. In some embodiments, the conjugate is connected to an Fc protein via a linker. In some embodiments, the conjugate is directly connected to an Fc protein without a linker.

The term “variant protein sequence” refers to a protein sequence that has one or more residues that differ in amino acid identity from another similar protein sequence. Said similar protein sequence may be the natural wild type protein sequence, or another variant of the wild type sequence. Variants include proteins that have one or more amino acid insertions, deletions or substitutions. Variants also include proteins that have one or more post-translationally modified amino acids.

The term “parent antibody” refers to an antibody known to those of skill in the art that is modified according to the description provided herein. The modification can be physical, i.e., chemically or biochemically replacing or modifying one or more amino acids of the parent antibody to yield an antibody within the scope of the present description. The modification can also be conceptual, i.e., using the sequence of one or more polypeptide chains of the parent antibody to design an antibody comprising one or more site-specific non-natural amino acids according to the present description. Parent antibodies can be naturally occurring antibodies or antibodies designed or developed in a laboratory. Parent antibodies can also be artificial or engineered antibodies, e.g., chimeric or humanized antibodies.

The term “conservatively modified variant” refers to an Fc protein that differs from a related Fc protein by conservative substitutions in amino acid sequence. One of skill will recognize that individual substitutions, deletions or additions to a peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the invention.

The following eight groups each contain amino acids that are conservative substitutions for one another:

-   -   1) Alanine (A), Glycine (G);     -   2) Aspartic acid (D), Glutamic acid (E);     -   3) Asparagine (N), Glutamine (Q);     -   4) Arginine (R), Lysine (K), Histidine (H);     -   5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V);     -   6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W);     -   7) Serine (S), Threonine (T); and     -   8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins:         Structures and Molecular Properties (W H Freeman & Co.; 2nd         edition (December 1993)

The terms “identical” or “identity,” in the context of two or more polypeptide sequences, refer to two or more sequences or subsequences that are the same. Sequences are “substantially identical” if they have a percentage of amino acid residues or nucleotides that are the same (i.e., about 60% identity, optionally about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, or about 95% identity over a specified region), when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. The identity can exist over a region that is at least about 50 amino acids or nucleotides in length, or over a region that is 75-100 amino acids or nucleotides in length, or, where not specified, across the entire sequence or a polypeptide. In the case of antibodies, identity can be measured outside the variable CDRs. Optimal alignment of sequences for comparison can be conducted, including but not limited to, by the local homology algorithm of Smith and Waterman (1970) Adv. Appl. Math. 2:482c, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity method of Pearson and Lipman (1988) Proc. Nat'l. Acad. Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.); or by manual alignment and visual inspection (see, e.g., Ausubel et al., Current Protocols in Molecular Biology (1995 supplement)).

Examples of algorithms that are suitable for determining percent sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) Nuc. Acids Res. 25:3389-3402, and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) or 10, M=5, N=−4 and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments (B) of 50, expectation (E) of 10, M=5, N=−4, and a comparison of both strands. The BLAST algorithm is typically performed with the “low complexity” filter turned off. In some embodiments, the BLAST algorithm is typically performed with the “low complexity” filter turned on.

The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5787). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01, and most preferably less than about 0.001.

The term “amino acid” refers to naturally occurring and non-naturally occurring amino acids, as well as amino acids such as proline, amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids.

Naturally encoded amino acids are the proteinogenic amino acids known to those of skill in the art. They include the 20 common amino acids (alanine, arginine, asparagine, aspartic acid, cysteine, glutamine, glutamic acid, glycine, histidine, isoleucine, leucine, lysine, methionine, phenylalanine, proline, serine, threonine, tryptophan, tyrosine, and valine) and the less common pyrrolysine and selenocysteine. Naturally encoded amino acids include post-translational variants of the 22 naturally occurring amino acids such as prenylated amino acids, isoprenylated amino acids, myrisoylated amino acids, palmitoylated amino acids, N-linked glycosylated amino acids, O-linked glycosylated amino acids, phosphorylated amino acids and acylated amino acids.

The term “non-natural amino acid” refers to an amino acid that is not a proteinogenic amino acid, or a post-translationally modified variant thereof. In particular, the term refers to an amino acid that is not one of the 20 common amino acids or pyrrolysine or selenocysteine, or post-translationally modified variants thereof.

A “functional Releasing Factor 1 (RF1) protein” refers to RF1 that retains activity equal to or substantially similar to wild-type or unmodified RF1 protein. Functional RF1 activity can be tested, for example, by measuring the growth rate of bacteria expressing the modified RF1 protein, and comparing the growth rate to bacteria expressing wild-type or unmodified RF1. Functional RF1 activity can also be tested, for example, by the ability of the modified RF1 protein to reduce orthogonal tRNA incorporation of a nnAA at a specified position in an mRNA encoding a target protein, thereby increasing the amount of premature chain termination (i.e., increasing the amount of truncated protein).

An “attenuated Releasing Factor 1 (RF1) protein” refers to a modified RF1 that retains reduced activity relative to wild-type or unmodified RF1 protein. RF1 activity can be tested, for example, by measuring the growth rate of bacteria expressing the modified RF1 protein, and comparing the growth rate to bacteria expressing wild-type or unmodified RF1. RF1 activity can also be tested, for example, by the ability of the modified RF1 protein to reduce orthogonal tRNA incorporation of a nnAA at a specified position in an mRNA encoding a target protein, thereby increasing the amount of premature chain termination (i.e., increasing the amount of truncated protein). In some embodiments, the attenuated RF1 protein comprises transcriptional modifications; for example, the expression level of the RF1 protein (wild type or modified) can be reduced to achieve attenuation. The reduction can also achieved by using RNAi technologies. In some embodiments, the attenuated RF1 protein comprises translational modifications; for example, the amount of the synthesized RF1 protein (wild type or modified) can be reduced to achieve attenuation, e.g., by increasing the rate at which the protein is digested by protease via insertion of protease-specific sequence into the RF1 sequence.

Fc Proteins

Provided herein are modified Fc proteins comprising one or more non-natural amino acid residues at site-specific positions in the amino acid sequence of at least one polypeptide chain.

The modified Fc protein can share high sequence identity with any Fc protein recognized by those of skill in the art, i.e. a parent Fc protein. In some embodiments, a parent Fc protein is an Fc fragment from an immunoglobulin that can be isolated from a subject. In certain embodiments, the amino acid sequence of the Fc protein is identical to the amino acid sequence of the parent Fc protein, other than the non-natural amino acids at site-specific position. In further embodiments, the modified Fc protein provided herein can have one or more insertions, deletions or mutations relative to the parent Fc protein in addition to the one or more non-natural amino acids at the site-specific positions. In certain embodiments, the modified Fc protein provided herein can have a unique primary sequence, so long as it would be recognized as an Fc protein by those of skill in the art.

In certain embodiments, the Fc protein comprising one or more non-natural amino acid residues at site-specific positions has high sequence identity to any parent Fc protein described herein or known to those of skill in the art. In certain embodiments, the Fc protein is substantially identical to a parent Fc protein described herein. In certain embodiments, the Fc protein has about 60% identity, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, or about 95% identity to a parent Fc protein. In certain embodiments, the Fc protein has greater than about 65%, greater than about 70%, greater than about 75%, greater than about 80%, greater than about 85%, greater than about 90%, or greater than about 95% identity to a parent Fc protein. Fc proteins are substantially identical if at least one polypeptide chain of Fc protein has sequence identity to a corresponding Fc protein chain of another Fc protein. In certain embodiments, the Fc protein comprising one or more non-natural amino acid residues at site-specific positions has at least one polypeptide chain having greater than about 65%, greater than about 70%, greater than about 75%, greater than about 80%, greater than about 85%, greater than about 90%, or greater than about 95% identity to any of SEQ ID NO: 1 and 4-6 over at least one domain, at least two domains or at least three domains. In certain embodiments, percent identity is over at least one chain. In certain embodiments, percent identity is over at least two chains.

In some embodiments and similar to a parent Fc protein, the modified Fc protein can be a monomer or a dimer. In certain embodiments, the modified Fc protein is a dimer comprising polypeptides corresponding to one or more constant domains of an antibody. In some embodiments, Fc proteins corresponding to IgG, IgG and IgD antibodies comprise domains corresponding to the C_(H)2 and C_(H)3 domains of these antibodies. In some embodiments, Fc proteins corresponding to IgM and IgE antibodies comprise domains corresponding to C_(H)2, C_(H)3 and C_(H)4 domains of these antibodies. Each polypeptide chain can be linked to the other polypeptide chain by one or more covalent disulfide bonds. Each polypeptide chain can also have one or more intrachain disulfide bonds. In certain embodiments, the Fc proteins are glycosylated. In some embodiments, the linker sequence includes a single amino acid. In other embodiments, the linker sequence includes a peptide. In some embodiments, linker peptide sequences can be random linker sequences that offer structural flexibilities. In some embodiments, linker peptide sequences can be selected based on the structures of the individual peptide chains, by, for example, searching libraries of protein or peptide structures. In certain embodiments, a linker peptide sequence is selected to best join the structures of individual peptide chains of the dimer or multimer. In some embodiments, non-natural amino acids can be incorporated into the linker sequence. In some embodiments, the linker can include non-amino acids such as alkanes, lipid or fat molecules.

As is known to those of skill in the art, Fc proteins typically have binding affinity for Fc receptors in vivo.

The modified Fc proteins provided herein can have sequences that are similar or identical to those of any Fc protein form known to those of skill in the art. They can be full-length, or fragments. Exemplary full length Fc proteins correspond to the Fc domains of IgA, IgA1, IgA2, IgD, IgE, IgG, IgG1, IgG2, IgG3, IgG4 or IgM.

The modified Fc proteins provided herein comprise at least one non-natural amino acid. The non-natural amino acid can be any non-natural amino acid known to those of skill in the art. Exemplary non-natural amino acids are described in the sections below.

The non-natural amino acids are positioned at selected locations in a polypeptide chain of the parent Fc protein. These locations were identified as providing optimum sites for substitution with the non-natural amino acids. Each site is capable of bearing a non-natural amino acid with optimum structure, function and/or methods for producing the modified Fc protein.

In certain embodiments, a site-specific position for substitution provides a modified Fc protein that is more stable compared to an Fc protein without the site-specific non-natural amino acid. Stability can be measured by any technique apparent to those of skill in the art.

In certain embodiments, a site-specific position for substitution provides a modified Fc protein that is has optimal functional properties compared to an Fc protein without the site-specific non-natural amino acid. For instance, the modified Fc protein can show little or no loss of binding affinity for Fc receptor compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show enhanced binding compared to an Fc protein without the site-specific non-natural amino acid.

In certain embodiments, a site-specific position for substitution provides a modified Fc protein that can be made advantageously. For instance, in certain embodiments, the modified Fc protein shows advantageous properties in its methods of synthesis compared to an Fc protein without the site-specific non-natural amino acid, discussed below. In certain embodiments, the modified Fc protein can show little or no loss in yield in production compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show enhanced yield in production compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show little or no loss of tRNA suppression, described below, compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show enhanced tRNA suppression, described below, in production compared to an Fc protein without the site-specific non-natural amino acid.

In certain embodiments, a site-specific position for substitution provides a modified Fc protein that has advantageous solubility compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show little or no loss in solubility compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show enhanced solubility compared to an Fc protein without the site-specific non-natural amino acid.

In certain embodiments, a site-specific position for substitution provides a modified Fc protein that has advantageous expression compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show little or no loss in expression compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show enhanced expression compared to an Fc protein without the site-specific non-natural amino acid.

In certain embodiments, a site-specific position for substitution provides a modified Fc protein that has advantageous folding compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show little or no loss in proper folding compared to an Fc protein without the site-specific non-natural amino acid. In certain embodiments, the modified Fc protein can show enhanced folding compared to an Fc protein without the site-specific non-natural amino acid.

In certain embodiments, a site-specific position for substitution provides a modified Fc protein that is capable of advantageous conjugation compared to an Fc protein without the site-specific non-natural amino acid. As described below, several non-natural amino acids have side chains or functional groups that facilitate conjugation of the modified Fc protein to a second agent, either directly or via a linker. In certain embodiments, the modified Fc protein can show enhanced conjugation efficiency compared to an Fc protein without the same or other non-natural amino acids at other positions. In certain embodiments, the modified Fc protein can show enhanced conjugation yield compared to an Fc protein without the same or other non-natural amino acids at other positions. In certain embodiments, the modified Fc protein can show enhanced conjugation specificity compared to an Fc protein without the same or other non-natural amino acids at other positions.

The one or more non-natural amino acids are located at selected site-specific positions in at least one polypeptide chain of the modified Fc protein. If the modified Fc protein is a dimer or multimer, at least one non-natural amino acid can be in either polypeptide of the dimer. Further, at least one non-natural amino acid can be in any domain of a polypeptide of an Fc protein. Of course, a modified Fc protein can comprise a plurality of non-natural amino acids at site-specific positions, in any domain and/or in any polypeptide.

In certain embodiments, the modified Fc proteins provided herein comprise one non-natural amino acid at a site-specific position. In certain embodiments, the modified Fc proteins provided herein comprise two non-natural amino acids at site-specific positions. In certain embodiments, the modified Fc proteins provided herein comprise three non-natural amino acids at site-specific positions. In certain embodiments, the modified Fc proteins provided herein comprise more than three non-natural amino acids at site-specific positions.

The site-specific positions for substituting can be described according to any Fc protein nomenclature system known to those of skill in the art. In the EU numbering system, these positions are at heavy chain residues H238, H239, H241, H243, H246, H262, H264, H265, H267, H268, H269, H270, H271, H272, H274, H275, H278, H280, H281, H282, H283, H286, H289, H292, H293, H294, H295, H296, H297, H298, H299, H300, H301, H303, H305, H317, H320, H324, H326, H327, H329, H330, H332, H333, H334, H335, H337, H339, H340, H342, H344, H355, H356, H358, H359, H360, H375, H383, H384, H386, H389, H392, H398, H404, H420, H421, H436, and H438. In other words, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H238, H239, H241, H243, H246, H262, H264, H265, H267, H268, H269, H270, H271, H272, H274, H275, H278, H280, H281, H282, H283, H286, H289, H292, H293, H294, H295, H296, H297, H298, H299, H300, H301, H303, H305, H317, H320, H324, H326, H327, H329, H330, H332, H333, H334, H335, H337, H339, H340, H342, H344, H355, H356, H358, H359, H360, H375, H383, H384, H386, H389, H392, H398, H404, H420, H421, H436, and H438.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H239, H241, H246, H267, H268, H269, H270, H271, H272, H274, H275, H280, H281, H282, H283, H286, H289, H292, H293, H294, H295, H296, H297, H298, H299, H300, H301, H303, H305, H317, H320, H324, H326, H327, H329, H330, H332, H333, H334, H335, H337, H339, H340, H342, H344, H355, H359, H375, H386, H389, H392, H398, H404, H420, H421, and H438.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H239, H293, H334, H355, H359, and H389.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H241, H246, H267, H268, H269, H270, H271, H272, H274, H275, H280, H281, H282, H283, H286, H289, H292, H294, H295, H296, H297, H298, H299, H300, H301, H303, H305, H317, H320, H324, H326, H327, H329, H330, H332, H333, H335, H337, H339, H342, H344, H355, H375, H386, H392, H398, H420, H421, H340, H404, and H438.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H238, H243, H262, H264, H265, H278, H342, H356, H358, H360, H383, H384, H404 and H436.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues corresponding to H292-H301, H303, and H305.

In the EU numbering system, these positions are at heavy chain residues H238, H239, H241, H243, H246, H262, H264, H265, H267, H268, H269, H270, H271, H272, H274, H275, H278, H280, H281, H282, H283, H286, H289, H292, H293, H294, H295, H296, H297, H298, H299, H300, H301, H303, H305, H317, H320, H324, H326, H327, H329, H330, H332, H333, H334, H335, H337, H339, H340, H342, H344, H355, H356, H358, H359, H360, H375, H383, H384, H386, H389, H392, H398, H404, H420, H421, H436, and H438. In other words, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H238, H239, H241, H243, H246, H262, H264, H265, H267, H268, H269, H270, H271, H272, H274, H275, H278, H280, H281, H282, H283, H286, H289, H292, H293, H294, H295, H296, H297, H298, H299, H300, H301, H303, H305, H317, H320, H324, H326, H327, H329, H330, H332, H333, H334, H335, H337, H339, H340, H342, H344, H355, H356, H358, H359, H360, H375, H383, H384, H386, H389, H392, H398, H404, H420, H421, H436, and H438.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H239, H241, H246, H267, H268, H269, H270, H271, H272, H274, H275, H280, H281, H282, H283, H286, H289, H292, H293, H294, H295, H296, H297, H298, H299, H300, H301, H303, H305, H317, H320, H324, H326, H327, H329, H330, H332, H333, H334, H335, H337, H339, H340, H342, H344, H355, H359, H375, H386, H389, H392, H398, H404, H420, H421, and H438.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H239, H293, H334, H342, H355, H359, and H389.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues corresponding to H292-H301, H303, and H305.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H241, H246, H267, H268, H269, H270, H271, H272, H274, H275, H280, H281, H282, H283, H286, H289, H292, H294, H295, H296, H297, H298, H299, H300, H301, H303, H305, H317, H320, H324, H326, H327, H329, H330, H332, H333, H335, H337, H339, H342, H344, H355, H375, H386, H389, H392, H398, H404, H420, H421, H340 and H438.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from EU residues H238, H243, H262, H264, H265, H278, H342, H356, H358, H360, H383, H384, H389, H404, and H436.

The site-specific positions can also be identified relative to the amino acid sequences of the polypeptide chains of a reference Fc protein. For example, the amino acid sequence of a reference heavy chain is provided in SEQ ID NO: 1. In the exemplary reference heavy chain, the site-specific positions are at residues 241, 242, 244, 246, 249, 265, 267, 268, 270, 271, 272, 273, 274, 275, 277, 278, 281, 283, 284, 285, 286, 289, 292, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 306, 308, 320, 323, 327, 329, 330, 332, 333, 335, 336, 337, 338, 340, 342, 343, 345, 347, 358, 359, 361, 362, 363, 378, 386, 387, 389, 392, 395, 401, 407, 423, 424, 439 and 441. In other words, provided herein are Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to residues 241, 242, 244, 246, 249, 265, 267, 268, 270, 271, 272, 273, 274, 275, 277, 278, 281, 283, 284, 285, 286, 289, 292, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 306, 308, 320, 323, 327, 329, 330, 332, 333, 335, 336, 337, 338, 340, 342, 343, 345, 347, 358, 359, 361, 362, 363, 378, 386, 387, 389, 392, 395, 401, 407, 423, 424, 439 and 441 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to residues 242, 244, 249, 270, 271, 272, 273, 274, 275, 277, 278, 283, 284, 285, 286, 289, 292, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 306, 308, 320, 323, 327, 329, 330, 332, 333, 335, 336, 337, 338, 340, 342, 343, 345, 347, 358, 362, 378, 389, 392, 395, 401, 407, 423, 424, and 441 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to residues 242, 296, 337, 345, 358, 362, and 392 of the representative heavy chain polypeptide according to SEQ ID NO:1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to residues 244, 249, 270, 271, 272, 273, 274, 275, 277, 278, 283, 284, 285, 286, 289, 292, 295, 297, 298, 299, 300, 301, 302, 303, 304, 306, 308, 320, 323, 327, 329, 330, 332, 333, 335, 336, 338, 340, 342, 343, 345, 347, 358, 378, 389, 395, 401, 407, 423, 424, and 441 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to residues 241, 246, 265, 267, 268, 281, 345, 359, 361, 363, 386, 387, 407, and 439 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 242, 244, 270, 273, 274, 275, 285, 289, 295, 296, 299, 300, 301, 332, 333, 337, 338, 343, 345, 358, 362, 389, 407, 423, 424, and 441 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to residues 292-301, 303, and 305 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 222, 285, 292, 299, 333, 338, 364, 403, 407, 425, 443, 263, 270, 271, 275, 277, 295, 296, 300, 301, 306, 308, 335, 336, 337, 343, 344, 345, 346, 358, 365, 389, 395, 407, 427, 441, 445, and 446 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 222, 285, 299, 338, 364, 425, 443, 270, 275, 277, 295, 296, 300, 301, 306, 308, 337, 343, 344, 345, 346, 358, 365, 395, 407, 427, 441, 445, and 446 of the representative heavy chain polypeptide according to SEQ ID NO:1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 222, 285, 299, 364, 425, 443, 270, 275, 296, 300, 301, 306, 308, 337, 343, 344, 345, 358, 365, 395, 407, 427, 441, 445, and 446 of the representative heavy chain polypeptide according to SEQ ID NO:1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 222, 285, 299, 425, 443, 270, 275, 296, 300, 301, 306, 308, 337, 343, 344, 345, 358, 395, 407, 427, 441, 445, and 446 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 222, 285, 299, 270, 296, 300, 301, 306, 308, 337, 343, 344, 395, 407, and 441 of the representative heavy chain polypeptide according to SEQ ID NO:1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 224, 225, 228, 230, 233, 234, 235 and 239 of the representative heavy chain polypeptide according to SEQ ID NO:1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 224, 225, 228, 230, 233, 234, 235 and 239 of the representative heavy chain polypeptide according to SEQ ID NO:1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 230, 233, 234, 235, and 239 of the representative heavy chain polypeptide according to SEQ ID NO:1.

In certain embodiments, provided herein are modified Fc proteins comprising one or more non-natural amino acids at one or more positions selected from those corresponding to 224, and 225 of the representative heavy chain polypeptide according to SEQ ID NO: 1.

In certain embodiments, the modified Fc protein comprises a polypeptide chain that can be described by the following formula (I):

Xaa¹- . . . -(Naa^(p(i)))^(n)- . . . -Xaa^(Q)  (I).

In Formula (I), each Xaa represents an amino acid in the polypeptide chain of any identity. In other words, each Xaa can be any amino acid, typically any naturally occurring amino acid, or a variant thereof. The superscript to the right of each Xaa represents the position of the amino acid within the primary sequence of the polypeptide chain. Xaa¹ represents the first, or N-terminal, amino acid in the polypeptide chain, and Xaa^(q) represents the last, or C-terminal, amino acid in the polypeptide chain. The variable q is an integer that is equal to the total number of amino acids in the polypeptide chain. Each Naa represents a non-natural amino acid within the polypeptide chain. Useful non-natural amino acids are described in the sections below. The integer n represents the number of non-natural amino acids in the polypeptide chain. In typical embodiments, n is an integer greater than 1. Each integer p(i) is greater than 1 and less than q, and the variable i is an integer that varies from 1 to n. Each integer p(i) represents a site-specific location in the amino acid sequence for the corresponding Naa. Each site specific location p(i) is optimal for substitution of a naturally occurring amino acid with a non-natural amino acid, such as Naa^(p(i)), according to the techniques described herein.

Analysis of the data from TAG screening as presented in the Examples allowed selection of sites which are ideal candidates for making Fc protein conjugates on the basis of cell killing, expression levels, drug-to-antibody ratio, solvent accessibility and (where available) thermal stability. The most preferred sites have lower solvent accessibility, and higher thermostability than the other tested variants. The preferred sites for nnAA incorporation are listed in Table I, below.

TABLE I Preferred Sites for nnAA Incorporation Site of nnAA Preference HC-F404 Most Preferred HC-F241 Most Preferred HC-K222 More Preferred

Accordingly, in certain embodiments, provided herein are antibodies comprising one or more non-natural amino acids at one or more positions selected from the group consisting of heavy chain or light chain residues HC-F404, HC-F241 and HC-K222 according to the EU numbering scheme, or a post-translationally modified variant thereof.

In certain embodiments, provided herein are antibodies comprising one or more non-natural amino acids at one or more positions selected from the group consisting of heavy chain or light chain residues ----- according to the EU numbering scheme, or a post-translationally modified variant thereof.

In certain embodiments, provided herein are antibodies comprising one or more non-natural amino acids at one or more positions selected from the group consisting of heavy chain or light chain residues ------ according to the EU numbering scheme, or a post-translationally modified variant thereof.

In some embodiments, modified Fc proteins provided herein can be further conservatively modified using methods known in the art. Conservatively modified variants of an Fc protein include one or more insertions, deletions or substitutions that do not disrupt the structure and/or function of the Fc protein when evaluated by one of skill in the art. In certain embodiments, conservatively modified variants include 20 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 15 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 10 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 9 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 8 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 7 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 6 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 5 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 4 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 3 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 2 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, conservatively modified variants include 1 amino acid insertion, deletion or substitution. In particular embodiments the substitutions are conservative, substituting an amino acid within the same class, as described above.

In certain embodiments, the modified Fc proteins can be further modified to modulate structure, stability and/or activity. In such embodiments, the modifications can be conservative or other than conservative. The modifications need only be suitable to the practitioner carrying out the methods and using the compositions described herein. In certain embodiments, the modifications decrease but do not eliminate antigen binding affinity. In certain embodiments, the modifications increase antigen binding affinity. In certain embodiments, the modifications enhance structure or stability of the Fc protein. In certain embodiments, the modifications reduce but do not eliminate structure or stability of the Fc protein. In certain embodiments, modified variants include 20 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 15 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 10 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 9 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 8 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 7 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 6 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 5 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 4 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 3 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 2 or fewer amino acid insertions, deletions or substitutions. In certain embodiments, modified variants include 1 amino acid insertion, deletion or substitution.

Also within the scope are post-translationally modified variants of modified Fc proteins. Any of the modified Fc proteins provided herein can be post-translationally modified in any manner recognized by those of skill in the art. Typical post-translational modifications for Fc proteins include interchain disulfide bonding, intrachain disulfide bonding, N-linked glycosylation and proteolysis. Also provided herein are other post-translationally modified Fc proteins having modifications such as phosphorylation, O-linked glycosylation, methylation, acetylation, lipidation, GPI anchoring, myristoylation and prenylation. The post-translational modification can occur during production, in vivo, in vitro or otherwise. In certain embodiments, the post-translational modification can be an intentional modification by a practitioner, for instance, using the methods provided herein.

Further included within the scope are modified Fc proteins fused to further peptides or polypeptides. Exemplary fusions include, but are not limited to, e.g., a methionyl Fc protein in which a methionine is linked to the N-terminus of the Fc protein resulting from the recombinant expression, fusions for the purpose of purification (including but not limited to, to poly-histidine or affinity epitopes), fusions for the purpose of linking to other biologically active molecules, fusions with serum albumin binding peptides, and fusions with serum proteins such as serum albumin. The modified Fc proteins may comprise protease cleavage sequences, reactive groups, Fc protein-binding domains (including but not limited to, FLAG or poly-His) or other affinity based sequences (including but not limited to, FLAG, poly-His, GST, etc.). The modified Fc proteins may also comprise linked molecules (including but not limited to, biotin) that improve detection (including but not limited to, GFP), purification or other features of the modified Fc protein. In certain embodiments, the modified Fc proteins comprise a C-terminal affinity sequence that facilitates purification of full length Fc proteins. In certain embodiments, such C-terminal affinity sequence is a poly-His sequence, e.g., a 6-His sequence.

The modified Fc protein can have any Fc protein form recognized by those of skill in the art. The Fc protein can comprise a single polypeptide chain. The modified Fc protein can also be in the form of a multimer that will be recognized by those of skill in the art including homodimers, heterodimers, homomultimers, and heteromultimers. These multimers can be linked or unlinked. Useful linkages include interchain disulfide bonds typical for modified Fc domains. The multimers can also be linked by other amino acids, including the non-natural amino acids introduced according to the present description. The modified Fc protein can be based on a parent immunoglobulin such as of any class or subclass including IgA, IgA1, IgA2, IgD, IgE, IgG, IgG1, IgG2, IgG3, IgG4 and IgM. In some embodiments, individual peptide chains of a modified Fc protein dimer or multimer can be joined by a linker sequence to form a single peptide chain. In some embodiments, the linker sequence includes a single amino acid. In other embodiments, the linker sequence includes a peptide. In some embodiments, linker peptide sequences can be random linker sequences that offer structure flexibilities. In some embodiments, linker peptide sequences can be selected based on the structures of the individual peptide chains, by, for example, searching libraries of protein or peptide structures. A linker peptide sequence is selected to best join the structures of individual peptide chains of the dimer or multimer. In some embodiments, non-natural amino acids can be incorporated into the linker sequence as well. In some embodiments, the linker can include non-amino acids such as alkanes, lipid or fat molecules. In some embodiments, a single peptide chain bearing the linker sequence can be synthesized using synthetic methods. In some embodiments, a nucleotide sequence encoding the single peptide chain can be expressed in a heterologous expression system to produce the desired dimer or multimer.

The modified Fc protein may further be glycosylated or aglycosyated. Accordingly, in certain embodiments, the modified Fc protein is glycosylated. In certain embodiments, the modified Fc protein is aglycosylated. Agylcosylated Fc proteins comprising a non-natural amino acid may be made, for example, as described in the Examples provided herein or through bacterial expression systems known to those skilled in the art. Glycosylated Fc proteins comprising a non-natural amino acid may be made, for example, as described by Axup et al., 2012, Proc. Nat. Acad. Sci. USA 109(40):16101-16106.

Also provided herein are modified Fc proteins that are conjugated to one or more conjugation moieties. The conjugation moiety can be any conjugation moiety deemed useful to one of skill in the art. For instance, the conjugation moiety can be a polymer, such as polyethylene glycol, that can improve the stability of the modified Fc protein in vitro or in vivo. The conjugation moiety can have therapeutic activity, thereby yielding an Fc protein-drug conjugate. The conjugation moiety can be a molecular payload that is harmful to target cells. The conjugation moiety can be a label useful for detection or diagnosis. In certain embodiments, the conjugation moiety is linked to the modified Fc protein via a direct covalent bond. In certain embodiments, the conjugation moiety is linked to the modified Fc protein via a linker. In advantageous embodiments, the conjugation moiety or the linker is attached via one of the non-natural amino acids of the modified Fc protein. Exemplary conjugation moieties and linkers are discussed in the sections below.

Non-Natural Amino Acids

The non-natural amino acid can be any non-natural amino acid known to those of skill in the art. In some embodiments, the non-naturally encoded amino acid comprises a functional group. The functional group can be any functional group known to those of skill in the art. In certain embodiments the functional group is a label, a polar group, a non-polar group or a reactive group.

Reactive groups are particularly advantageous for linking further functional groups to the Fc protein at the site-specific position of the Fc protein chain. In certain embodiments, the reactive group is selected from the group consisting of amino, carboxy, acetyl, hydrazino, hydrazido, semicarbazido, sulfanyl, azido and alkynyl.

In certain embodiments, the amino acid residue is according to any of the following formulas:

Those of skill in the art will recognize that Fc proteins are generally comprised of L-amino acids. However, with non-natural amino acids, the present methods and compositions provide the practitioner with the ability to use L-, D- or racemic non-natural amino acids at the site-specific positions. In certain embodiments, the non-natural amino acids described herein include D-versions of the natural amino acids and racemic versions of the natural amino acids.

In the above formulas, the wavy lines indicate bonds that connect to the remainder of the polypeptide chains of the Fc proteins. These non-natural amino acids can be incorporated into polypeptide chains just as natural amino acids are incorporated into the same polypeptide chains. In certain embodiments, the non-natural amino acids are incorporated into the polypeptide chain via amide bonds as indicated in the formulas.

In the above formulas R designates any functional group without limitation, so long as the amino acid residue is not identical to a natural amino acid residue. In certain embodiments, R can be a hydrophobic group, a hydrophilic group, a polar group, an acidic group, a basic group, a chelating group, a reactive group, a therapeutic moiety or a labeling moiety. In certain embodiments, R is selected from the group consisting of R¹NR²R³, R¹C(═O)R², R¹C(═O)OR², R¹N₃, R¹C(≡CH). In these embodiments, R¹ is selected from the group consisting of a bond, alkylene, heteroalkylene, arylene, heteroarylene. R² and R³ are each independently selected from the group consisting of hydrogen, alkyl and heteroalkyl.

In some embodiments, the non-naturally encoded amino acids include side chain functional groups that react efficiently and selectively with functional groups not found in the 20 common amino acids (including but not limited to, azido, ketone, aldehyde and aminooxy groups) to form stable conjugates. For example, antigen-binding polypeptide that includes a non-naturally encoded amino acid containing an azido functional group can be reacted with a polymer (including but not limited to, poly(ethylene glycol) or, alternatively, a second polypeptide containing an alkyne moiety to form a stable conjugate resulting for the selective reaction of the azide and the alkyne functional groups to form a Huisgen [3+2] cycloaddition product.

Exemplary non-naturally encoded amino acids that may be suitable for use in the present invention and that are useful for reactions with water soluble polymers include, but are not limited to, those with carbonyl, aminooxy, hydrazine, hydrazide, semicarbazide, azide and alkyne reactive groups. In some embodiments, non-naturally encoded amino acids comprise a saccharide moiety. Examples of such amino acids include N-acetyl-L-glucosaminyl-L-serine, N-acetyl-L-galactosaminyl-L-serine, N-acetyl-L-glucosaminyl-L-threonine, N-acetyl-L-glucosaminyl-L-asparagine and O-mannosaminyl-L-serine. Examples of such amino acids also include examples where the naturally-occurring N- or O-linkage between the amino acid and the saccharide is replaced by a covalent linkage not commonly found in nature-including but not limited to, an alkene, an oxime, a thioether, an amide and the like. Examples of such amino acids also include saccharides that are not commonly found in naturally-occurring proteins such as 2-deoxy-glucose, 2-deoxygalactose and the like.

Many of the non-naturally encoded amino acids provided herein are commercially available, e.g., from Sigma-Aldrich (St. Louis, Mo., USA), Novabiochem (a division of EMD Biosciences, Darmstadt, Germany), or Peptech (Burlington, Mass., USA). Those that are not commercially available are optionally synthesized as provided herein or using standard methods known to those of skill in the art. For organic synthesis techniques, see, e.g., Organic Chemistry by Fessendon and Fessendon, (1982, Second Edition, Willard Grant Press, Boston Mass.); Advanced Organic Chemistry by March (Third Edition, 1985, Wiley and Sons, New York); and Advanced Organic Chemistry by Carey and Sundberg (Third Edition, Parts A and B, 1990, Plenum Press, New York). See, also, U.S. Patent Application Publications 2003/0082575 and 2003/0108885, which is incorporated by reference herein. In addition to unnatural amino acids that contain novel side chains, unnatural amino acids that may be suitable for use in the present invention also optionally comprise modified backbone structures, including but not limited to, as illustrated by the structures of Formula II and III:

wherein Z typically comprises OH, NH₂, SH, NH—R′, or S—R′; X and Y, which can be the same or different, typically comprise S or O, and R and R′, which are optionally the same or different, are typically selected from the same list of constituents for the R group described above for the unnatural amino acids having Formula I as well as hydrogen. For example, unnatural amino acids of the invention optionally comprise substitutions in the amino or carboxyl group as illustrated by Formulas II and III. Unnatural amino acids of this type include, but are not limited to, α-hydroxy acids, α-thioacids, α-aminothiocarboxylates, including but not limited to, with side chains corresponding to the common twenty natural amino acids or unnatural side chains. In addition, substitutions at the α-carbon optionally include, but are not limited to, L, D, or α-α-disubstituted amino acids such as D-glutamate, D-alanine, D-methyl-O-tyrosine, aminobutyric acid, and the like. Other structural alternatives include cyclic amino acids, such as proline analogues as well as 3, 4, 6, 7, 8, and 9 membered ring proline analogues, P and y amino acids such as substituted β-alanine and γ-amino butyric acid.

Many unnatural amino acids are based on natural amino acids, such as tyrosine, glutamine, phenylalanine, and the like, and are suitable for use in the present invention. Tyrosine analogs include, but are not limited to, para-substituted tyrosines, ortho-substituted tyrosines, and meta substituted tyrosines, where the substituted tyrosine comprises, including but not limited to, a keto group (including but not limited to, an acetyl group), a benzoyl group, an amino group, a hydrazine, an hydroxyamine, a thiol group, a carboxy group, an isopropyl group, a methyl group, a C₆-C₂₀ straight chain or branched hydrocarbon, a saturated or unsaturated hydrocarbon, an O-methyl group, a polyether group, a nitro group, an alkynyl group or the like. In addition, multiply substituted aryl rings are also contemplated. Glutamine analogs that may be suitable for use in the present invention include, but are not limited to, α-hydroxy derivatives, γ-substituted derivatives, cyclic derivatives, and amide substituted glutamine derivatives. Example phenylalanine analogs that may be suitable for use in the present invention include, but are not limited to, para-substituted phenylalanines, ortho-substituted phenyalanines, and meta-substituted phenylalanines, where the substituent comprises, including but not limited to, a hydroxy group, a methoxy group, a methyl group, an allyl group, an aldehyde, an azido, an iodo, a bromo, a keto group (including but not limited to, an acetyl group), a benzoyl, an alkynyl group, or the like. Specific examples of unnatural amino acids that may be suitable for use in the present invention include, but are not limited to, a p-acetyl-L-phenylalanine, an O-methyl-L-tyrosine, an L-3-(2-naphthyl)alanine, a 3-methyl-phenylalanine, an O-4-allyl-L-tyrosine, a 4-propyl-L-tyrosine, a tri-O-acetyl-GlcNAcβ-serine, an L-Dopa, a fluorinated phenylalanine, an isopropyl-L-phenylalanine, a p-azido-L-phenylalanine, a p-acyl-L-phenylalanine, a p-benzoyl-L-phenylalanine, an L-phosphoserine, a phosphonoserine, a phosphonotyrosine, a p-iodo-phenylalanine, a p-bromophenylalanine, a p-amino-L-phenylalanine, an isopropyl-L-phenylalanine, and a p-propargyloxy-phenylalanine, and the like. Examples of structures of a variety of unnatural amino acids that may be suitable for use in the present invention are provided in, for example, WO 2002/085923 entitled “In vivo incorporation of unnatural amino acids.” See also Kiick et al., (2002) Incorporation of azides into recombinant proteins for chemoselective modification by the Staudinger ligation, PNAS 99:19-24, for additional methionine analogs.

Many of the unnatural amino acids suitable for use in the present invention are commercially available, e.g., from Sigma (USA) or Aldrich (Milwaukee, Wis., USA). Those that are not commercially available are optionally synthesized as provided herein or as provided in various publications or using standard methods known to those of skill in the art. For organic synthesis techniques, see, e.g., Organic Chemistry by Fessendon and Fessendon, (1982, Second Edition, Willard Grant Press, Boston Mass.); Advanced Organic Chemistry by March (Third Edition, 1985, Wiley and Sons, New York); and Advanced Organic Chemistry by Carey and Sundberg (Third Edition, Parts A and B, 1990, Plenum Press, New York). Additional publications describing the synthesis of unnatural amino acids include, e.g., WO 2002/085923 entitled “In vivo incorporation of Unnatural Amino Acids;” Matsoukas et al., (1995) J. Med. Chem., 38, 4660-4669; King, F. E. & Kidd, D. A. A. (1949) A New Synthesis of Glutamine and of γ-Dipeptides of Glutamic Acid from Phthylated Intermediates. J. Chem. Soc., 3315-3319; Friedman, O. M. & Chatterrji, R. (1959) Synthesis of Derivatives of Glutamine as Model Substrates for Anti-Tumor Agents. J. Am. Chem. Soc. 81, 3750-3752; Craig, J. C. et al. (1988) Absolute Configuration of the Enantiomers of 7-Chloro-4 [[4-(diethylamino)-1-methylbutyl]amino]quinoline (Chloroquine). J. Org. Chem. 53, 1167-1170; Azoulay, M., Vilmont, M. & Frappier, F. (1991) Glutamine analogues as Potential Antimalarials, Eur. J. Med. Chem. 26, 201-5; Koskinen, A. M. P. & Rapoport, H. (1989) Synthesis of 4-Substituted Prolines as Conformationally Constrained Amino Acid Analogues. J. Org. Chem. 54, 1859-1866; Christie, B. D. & Rapoport, H. (1985) Synthesis of Optically Pure Pipecolates from L-Asparagine. Application to the Total Synthesis of (+)-Apovincamine through Amino Acid Decarbonylation and Iminium Ion Cyclization. J. Org. Chem. 1989:1859-1866; Barton et al., (1987) Synthesis of Novel a-Amino-Acids and Derivatives Using Radical Chemistry: Synthesis of L- and D-a-Amino-Adipic Acids, L-a-aminopimelic Acid and Appropriate Unsaturated Derivatives. Tetrahedron Lett. 43:4297-4308; and, Subasinghe et al., (1992) Quisqualic acid analogues: synthesis of beta-heterocyclic 2-aminopropanoic acid derivatives and their activity at a novel quisqualate-sensitized site. J. Med. Chem. 35:4602-7. See also, patent applications entitled “Protein Arrays,” filed Dec. 22, 2003, Ser. No. 10/744,899 and Ser. No. 60/435,821 filed on Dec. 22, 2002.

Amino acids with a carbonyl reactive group allow for a variety of reactions to link molecules (including but not limited to, PEG or other water soluble molecules) via nucleophilic addition or aldol condensation reactions among others.

Exemplary carbonyl-containing amino acids can be represented as follows:

wherein n is 0-10; R₁ is an alkyl, aryl, substituted alkyl, or substituted aryl; R₂ is H, alkyl, aryl, substituted alkyl, and substituted aryl; and R₃ is H, an amino acid, a polypeptide, or an amino terminus modification group, and R₄ is H, an amino acid, a polypeptide, or a carboxy terminus modification group. In some embodiments, n is 1, R₁ is phenyl and R₂ is a simple alkyl (i.e., methyl, ethyl, or propyl) and the ketone moiety is positioned in the para position relative to the alkyl side chain. In some embodiments, n is 1, R₁ is phenyl and R₂ is a simple alkyl (i.e., methyl, ethyl, or propyl) and the ketone moiety is positioned in the meta position relative to the alkyl side chain.

In the present invention, a non-naturally encoded amino acid bearing adjacent hydroxyl and amino groups can be incorporated into the polypeptide as a “masked” aldehyde functionality. For example, 5-hydroxylysine bears a hydroxyl group adjacent to the epsilon amine. Reaction conditions for generating the aldehyde typically involve addition of molar excess of sodium metaperiodate under mild conditions to avoid oxidation at other sites within the polypeptide. The pH of the oxidation reaction is typically about 7.0. A typical reaction involves the addition of about 1.5 molar excess of sodium meta periodate to a buffered solution of the polypeptide, followed by incubation for about 10 minutes in the dark. See, e.g. U.S. Pat. No. 6,423,685, which is incorporated by reference herein.

The carbonyl functionality can be reacted selectively with a hydrazine-, hydrazide-, hydroxylamine-, or semicarbazide-containing reagent under mild conditions in aqueous solution to form the corresponding hydrazone, oxime, or semicarbazone linkages, respectively, that are stable under physiological conditions. See, e.g., Jencks, W. P., J. Am. Chem. Soc. 81, 475-481 (1959); Shao, J. and Tam, J. P., J. Am. Chem. Soc. 117:3893-3899 (1995). Moreover, the unique reactivity of the carbonyl group allows for selective modification in the presence of the other amino acid side chains. See, e.g., Cornish, V. W., et al., J. Am. Chem. Soc. 118:8150-8151 (1996); Geoghegan, K. F. & Stroh, J. G., Bioconjug. Chem. 3:138-146 (1992); Mahal, L. K., et al., Science 276:1125-1128 (1997).

Non-naturally encoded amino acids containing a nucleophilic group, such as a hydrazine, hydrazide or semicarbazide, allow for reaction with a variety of electrophilic groups to form conjugates (including but not limited to, with PEG or other water soluble polymers).

Exemplary hydrazine, hydrazide or semicarbazide-containing amino acids can be represented as follows:

wherein n is 0-10; R₁ is an alkyl, aryl, substituted alkyl, or substituted aryl or not present; X, is O, N, or S or not present; R₂ is H, an amino acid, a polypeptide, or an amino terminus modification group, and R₃ is H, an amino acid, a polypeptide, or a carboxy terminus modification group.

In some embodiments, n is 4, R₁ is not present, and X is N. In some embodiments, n is 2, R₁ is not present, and X is not present. In some embodiments, n is 1, R₁ is phenyl, X is O, and the oxygen atom is positioned para to the alphatic group on the aryl ring.

Hydrazide-, hydrazine-, and semicarbazide-containing amino acids are available from commercial sources. For instance, L-glutamate-γ-hydrazide is available from Sigma Chemical (St. Louis, Mo.). Other amino acids not available commercially can be prepared by one skilled in the art. See, e.g., U.S. Pat. No. 6,281,211, which is incorporated by reference herein.

Polypeptides containing non-naturally encoded amino acids that bear hydrazide, hydrazine or semicarbazide functionalities can be reacted efficiently and selectively with a variety of molecules that contain aldehydes or other functional groups with similar chemical reactivity. See, e.g., Shao, J. and Tam, J., J. Am. Chem. Soc. 117:3893-3899 (1995). The unique reactivity of hydrazide, hydrazine and semicarbazide functional groups makes them significantly more reactive toward aldehydes, ketones and other electrophilic groups as compared to the nucleophilic groups present on the 20 common amino acids (including but not limited to, the hydroxyl group of serine or threonine or the amino groups of lysine and the N-terminus).

Non-naturally encoded amino acids containing an aminooxy (also called a hydroxylamine) group allow for reaction with a variety of electrophilic groups to form conjugates (including but not limited to, with PEG or other water soluble polymers). Like hydrazines, hydrazides and semicarbazides, the enhanced nucleophilicity of the aminooxy group permits it to react efficiently and selectively with a variety of molecules that contain aldehydes or other functional groups with similar chemical reactivity. See, e.g., Shao, J. and Tam, J., J. Am. Chem. Soc. 117:3893-3899 (1995); H. Hang and C. Bertozzi, Acc. Chem. Res. 34: 727-736 (2001). Whereas the result of reaction with a hydrazine group is the corresponding hydrazone, however, an oxime results generally from the reaction of an aminooxy group with a carbonyl-containing group such as a ketone.

Exemplary amino acids containing aminooxy groups can be represented as follows:

wherein n is 0-10; R₁ is an alkyl, aryl, substituted alkyl, or substituted aryl or not present; X is O, N, S or not present; m is 0-10; Y═C(O) or not present; R₂ is H, an amino acid, a polypeptide, or an amino terminus modification group, and R₃ is H, an amino acid, a polypeptide, or a carboxy terminus modification group. In some embodiments, n is 1, R₁ is phenyl, X is O, m is 1, and Y is present. In some embodiments, n is 2, R₁ and X are not present, m is 0, and Y is not present.

Aminooxy-containing amino acids can be prepared from readily available amino acid precursors (homoserine, serine and threonine). See, e.g., M. Carrasco and R. Brown, J. Org. Chem. 68: 8853-8858 (2003). Certain aminooxy-containing amino acids, such as L-2-amino-4-(aminooxy)butyric acid), have been isolated from natural sources (Rosenthal, G. et al., Life Sci. 60: 1635-1641 (1997). Other aminooxy-containing amino acids can be prepared by one skilled in the art.

The unique reactivity of azide and alkyne functional groups makes them extremely useful for the selective modification of polypeptides and other biological molecules. Organic azides, particularly alphatic azides, and alkynes are generally stable toward common reactive chemical conditions. In particular, both the azide and the alkyne functional groups are inert toward the side chains (i.e., R groups) of the 20 common amino acids found in naturally-occurring polypeptides. When brought into close proximity, however, the “spring-loaded” nature of the azide and alkyne groups is revealed and they react selectively and efficiently via Huisgen [3+2] cycloaddition reaction to generate the corresponding triazole. See, e.g., Chin J., et al., Science 301:964-7 (2003); Wang, Q., et al., J. Am. Chem. Soc. 125, 3192-3193 (2003); Chin, J. W., et al., J. Am. Chem. Soc. 124:9026-9027 (2002).

Because the Huisgen cycloaddition reaction involves a selective cycloaddition reaction (see, e.g., Padwa, A., in COMPREHENSIVE ORGANIC SYNTHESIS, Vol. 4, (ed. Trost, B. M., 1991), p. 1069-1109; Huisgen, R. in 1,3-DIPOLAR CYCLOADDITION CHEMISTRY, (ed. Padwa, A., 1984), p. 1-176) rather than a nucleophilic substitution, the incorporation of non-naturally encoded amino acids bearing azide and alkyne-containing side chains permits the resultant polypeptides to be modified selectively at the position of the non-naturally encoded amino acid. Cycloaddition reaction involving azide or alkyne-containing antibody can be carried out at room temperature under aqueous conditions by the addition of Cu(II) (including but not limited to, in the form of a catalytic amount of CuSO₄) in the presence of a reducing agent for reducing Cu(II) to Cu(I), in situ, in catalytic amount. See, e.g., Wang, Q., et al., J. Am. Chem. Soc. 125, 3192-3193 (2003); Tornoe, C. W., et al., J. Org. Chem. 67:3057-3064 (2002); Rostovtsev, et al., Angew. Chem. Int. Ed. 41:2596-2599 (2002). Exemplary reducing agents include, including but not limited to, ascorbate, metallic copper, quinine, hydroquinone, vitamin K, glutathione, cysteine, Fe²⁺, Co²⁺, and an applied electric potential.

In some cases, where a Huisgen [3+2] cycloaddition reaction between an azide and an alkyne is desired, the antigen-binding polypeptide comprises a non-naturally encoded amino acid comprising an alkyne moiety and the water soluble polymer to be attached to the amino acid comprises an azide moiety. Alternatively, the converse reaction (i.e., with the azide moiety on the amino acid and the alkyne moiety present on the water soluble polymer) can also be performed.

The azide functional group can also be reacted selectively with a water soluble polymer containing an aryl ester and appropriately functionalized with an aryl phosphine moiety to generate an amide linkage. The aryl phosphine group reduces the azide in situ and the resulting amine then reacts efficiently with a proximal ester linkage to generate the corresponding amide. See, e.g., E. Saxon and C. Bertozzi, Science 287, 2007-2010 (2000). The azide-containing amino acid can be either an alkyl azide (including but not limited to, 2-amino-6-azido-1-hexanoic acid) or an aryl azide (p-azido-phenylalanine).

Exemplary water soluble polymers containing an aryl ester and a phosphine moiety can be represented as follows:

wherein X can be O, N, S or not present, Ph is phenyl, W is a water soluble polymer and R can be H, alkyl, aryl, substituted alkyl and substituted aryl groups. Exemplary R groups include but are not limited to —CH₂, —C(CH₃)₃, —OR′, —NR′R″, —SR′, -halogen, —C(O)R′, —CONR′R″, —S(O)₂R′, —S(O)₂NR′R″, —CN and —NO₂. R′, R″, R′″ and R″″ each independently refer to hydrogen, substituted or unsubstituted heteroalkyl, substituted or unsubstituted aryl, including but not limited to, aryl substituted with 1-3 halogens, substituted or unsubstituted alkyl, alkoxy or thioalkoxy groups, or arylalkyl groups. When a compound of the invention includes more than one R group, for example, each of the R groups is independently selected as are each R′, R″, R′″ and R″″ groups when more than one of these groups is present. When R′ and R″ are attached to the same nitrogen atom, they can be combined with the nitrogen atom to form a 5-, 6-, or 7-membered ring. For example, —NR′R″ is meant to include, but not be limited to, 1-pyrrolidinyl and 4-morpholinyl. From the above discussion of substituents, one of skill in the art will understand that the term “alkyl” is meant to include groups including carbon atoms bound to groups other than hydrogen groups, such as haloalkyl (including but not limited to, —CF₃ and —CH₂CF₃) and acyl (including but not limited to, —C(O)CH₃, —C(O)CF₃, —C(O)CH₂OCH₃, and the like).

The azide functional group can also be reacted selectively with a water soluble polymer containing a thioester and appropriately functionalized with an aryl phosphine moiety to generate an amide linkage. The aryl phosphine group reduces the azide in situ and the resulting amine then reacts efficiently with the thioester linkage to generate the corresponding amide. Exemplary water soluble polymers containing a thioester and a phosphine moiety can be represented as follows:

wherein n is 1-10; X can be O, N, S or not present, Ph is phenyl, and W is a water soluble polymer.

Exemplary alkyne-containing amino acids can be represented as follows:

wherein n is 0-10; R₁ is an alkyl, aryl, substituted alkyl, or substituted aryl or not present; X is O, N, S or not present; m is 0-10, R₂ is H, an amino acid, a polypeptide, or an amino terminus modification group, and R₃ is H, an amino acid, a polypeptide, or a carboxy terminus modification group. In some embodiments, n is 1, R₁ is phenyl, X is not present, m is 0 and the acetylene moiety is positioned in the para position relative to the alkyl side chain. In some embodiments, n is 1, R₁ is phenyl, X is O, m is 1 and the propargyloxy group is positioned in the para position relative to the alkyl side chain (i.e., O-propargyl-tyrosine). In some embodiments, n is 1, R₁ and X are not present and m is 0 (i.e., proparylglycine).

Alkyne-containing amino acids are commercially available. For example, propargylglycine is commercially available from Peptech (Burlington, Mass.). Alternatively, alkyne-containing amino acids can be prepared according to standard methods. For instance, p-propargyloxyphenylalanine can be synthesized, for example, as described in Deiters, A., et al., J. Am. Chem. Soc. 125: 11782-11783 (2003), and 4-alkynyl-L-phenylalanine can be synthesized as described in Kayser, B., et al., Tetrahedron 53(7): 2475-2484 (1997). Other alkyne-containing amino acids can be prepared by one skilled in the art.

Exemplary azide-containing amino acids can be represented as follows:

wherein n is 0-10; R₁ is an alkyl, aryl, substituted alkyl, substituted aryl or not present; X is O, N, S or not present; m is 0-10; R₂ is H, an amino acid, a polypeptide, or an amino terminus modification group, and R₃ is H, an amino acid, a polypeptide, or a carboxy terminus modification group. In some embodiments, n is 1, R₁ is phenyl, X is not present, m is 0 and the azide moiety is positioned para to the alkyl side chain. In some embodiments, n is 0-4 and R₁ and X are not present, and m=0. In some embodiments, n is 1, R₁ is phenyl, X is O, m is 2 and the P-azidoethoxy moiety is positioned in the para position relative to the alkyl side chain.

Azide-containing amino acids are available from commercial sources. For instance, 4-azidophenylalanine can be obtained from Chem-Impex International, Inc. (Wood Dale, Ill.). For those azide-containing amino acids that are not commercially available, the azide group can be prepared relatively readily using standard methods known to those of skill in the art, including but not limited to, via displacement of a suitable leaving group (including but not limited to, halide, mesylate, tosylate) or via opening of a suitably protected lactone. See, e.g., Advanced Organic Chemistry by March (Third Edition, 1985, Wiley and Sons, New York).

The unique reactivity of beta-substituted aminothiol functional groups makes them extremely useful for the selective modification of polypeptides and other biological molecules that contain aldehyde groups via formation of the thiazolidine. See, e.g., J. Shao and J. Tam, J. Am. Chem. Soc. 1995, 117 (14) 3893-3899. In some embodiments, beta-substituted aminothiol amino acids can be incorporated into antibodies and then reacted with water soluble polymers comprising an aldehyde functionality. In some embodiments, a water soluble polymer, drug conjugate or other payload can be coupled to an antibody polypeptide comprising a beta-substituted aminothiol amino acid via formation of the thiazolidine.

Particular examples of useful non-natural amino acids include, but are not limited to, p-acetyl-L-phenylalanine, O-methyl-L-tyrosine, L-3-(2-naphthyl)alanine, 3-methyl-phenylalanine, O-4-allyl-L-tyrosine, 4-propyl-L-tyrosine, tri-O-acetyl-GlcNAc b-serine, L-Dopa, fluorinated phenylalanine, isopropyl-L-phenylalanine, p-azido-L-phenylalanine, p-acyl-L-phenylalanine, p-benzoyl-L-phenylalanine, L-phosphoserine, phosphonoserine, phosphonotyrosine, p-iodo-phenylalanine, p-bromophenylalanine, p-amino-L-phenylalanine, isopropyl-L-phenylalanine, and p-propargyloxy-phenylalanine. Further useful examples include N-acetyl-L-glucosaminyl-L-serine, N-acetyl-L-galactosaminyl-L-serine, N-acetyl-L-glucosaminyl-L-threonine, N-acetyl-L-glucosaminyl-L-asparagine and O-mannosaminyl-L-serine.

In particular embodiments, the non-natural amino acids are selected from p-acetyl-phenylalanine, p-ethynyl-phenylalanine, p-propargyloxyphenylalanine, and p-azido-phenylalanine. One particularly useful non-natural amino acid is p-azido phenylalanine. This amino acid residue is known to those of skill in the art to facilitate Huisgen [3+2] cyloaddition reactions (so-called “click” chemistry reactions) with, for example, compounds bearing alkynyl groups. This reaction enables one of skill in the art to readily and rapidly conjugate to the antibody at the site-specific location of the non-natural amino acid.

In certain embodiments, the first reactive group is an alkynyl moiety (including but not limited to, in the unnatural amino acid p-propargyloxyphenylalanine, where the propargyl group is also sometimes referred to as an acetylene moiety) and the second reactive group is an azido moiety, and [3+2] cycloaddition chemistry can be used. In certain embodiments, the first reactive group is the azido moiety (including but not limited to, in the unnatural amino acid p-azido-L-phenylalanine) and the second reactive group is the alkynyl moiety.

In the above formulas, each L represents a divalent linker. The divalent linker can be any divalent linker known to those of skill in the art. Generally, the divalent linker is capable of forming covalent bonds to the functional moiety R and the alpha carbon of the non-natural amino acid. Useful divalent linkers a bond, alkylene, substituted alkylene, heteroalkylene, substituted heteroalkylene, arylene, substituted arylene, heteroarlyene and substituted heteroarylene. In certain embodiments, L is C₁₋₁₀ alkylene or C₁₋₁₀ heteroalkylene.

The non-natural amino acids used in the methods and compositions described herein have at least one of the following four properties: (1) at least one functional group on the sidechain of the non-natural amino acid has at least one characteristics and/or activity and/or reactivity orthogonal to the chemical reactivity of the 20 common, genetically-encoded amino acids (i.e., alanine, arginine, asparagine, aspartic acid, cysteine, glutamine, glutamic acid, glycine, histidine, isoleucine, leucine, lysine, methionine, phenylalanine, proline, serine, threonine, tryptophan, tyrosine, and valine), or at least orthogonal to the chemical reactivity of the naturally occurring amino acids present in the polypeptide that includes the non-natural amino acid; (2) the introduced non-natural amino acids are substantially chemically inert toward the 20 common, genetically-encoded amino acids; (3) the non-natural amino acid can be stably incorporated into a polypeptide, preferably with the stability commensurate with the naturally-occurring amino acids or under typical physiological conditions, and further preferably such incorporation can occur via an in vivo system; and (4) the non-natural amino acid includes an oxime functional group or a functional group that can be transformed into an oxime group by reacting with a reagent, preferably under conditions that do not destroy the biological properties of the polypeptide that includes the non-natural amino acid (unless of course such a destruction of biological properties is the purpose of the modification/transformation), or where the transformation can occur under aqueous conditions at a pH between about 4 and about 8, or where the reactive site on the non-natural amino acid is an electrophilic site. Illustrative, non-limiting examples of amino acids that satisfy these four properties for non-natural amino acids that can be used with the compositions and methods described herein are presented in FIGS. 2, 3, 35 and 40-43. Any number of non-natural amino acids can be introduced into the polypeptide. Non-natural amino acids may also include protected or masked oximes or protected or masked groups that can be transformed into an oxime group after deprotection of the protected group or unmasking of the masked group. Non-natural amino acids may also include protected or masked carbonyl or dicarbonyl groups, which can be transformed into a carbonyl or dicarbonyl group after deprotection of the protected group or unmasking of the masked group and thereby are available to react with hydroxylamines or oximes to form oxime groups.

In further embodiments, non-natural amino acids that may be used in the methods and compositions described herein include, but are not limited to, amino acids comprising a photoactivatable cross-linker, spin-labeled amino acids, fluorescent amino acids, metal binding amino acids, metal-containing amino acids, radioactive amino acids, amino acids with novel functional groups, amino acids that covalently or non-covalently interact with other molecules, photocaged and/or photoisomerizable amino acids, amino acids comprising biotin or a biotin analogue, glycosylated amino acids such as a sugar substituted serine, other carbohydrate modified amino acids, keto-containing amino acids, aldehyde-containing amino acids, amino acids comprising polyethylene glycol or other polyethers, heavy atom substituted amino acids, chemically cleavable and/or photocleavable amino acids, amino acids with an elongated side chains as compared to natural amino acids, including but not limited to, polyethers or long chain hydrocarbons, including but not limited to, greater than about 5 or greater than about 10 carbons, carbon-linked sugar-containing amino acids, redox-active amino acids, amino thioacid containing amino acids, and amino acids comprising one or more toxic moiety.

In some embodiments, non-natural amino acids comprise a saccharide moiety. Examples of such amino acids include N-acetyl-L-glucosaminyl-L-serine, N-acetyl-L-galactosaminyl-L-serine, N-acetyl-L-glucosaminyl-L-threonine, N-acetyl-L-glucosaminyl-L-asparagine and O-mannosaminyl-L-serine. Examples of such amino acids also include examples where the naturally-occurring N- or O-linkage between the amino acid and the saccharide is replaced by a covalent linkage not commonly found in nature-including but not limited to, an alkene, an oxime, a thioether, an amide and the like. Examples of such amino acids also include saccharides that are not commonly found in naturally-occurring proteins such as 2-deoxy-glucose, 2-deoxygalactose and the like.

The chemical moieties incorporated into antibodies via incorporation of non-natural amino acids offer a variety of advantages and manipulations of polypeptides. For example, the unique reactivity of a carbonyl or dicarbonyl functional group (including a keto- or aldehyde-functional group) allows selective modification of antibodies with any of a number of hydrazine- or hydroxylamine-containing reagents in vivo and in vitro. A heavy atom non-natural amino acid, for example, can be useful for phasing x-ray structure data. The site-specific introduction of heavy atoms using non-natural amino acids also provides selectivity and flexibility in choosing positions for heavy atoms. Photoreactive non-natural amino acids (including but not limited to, amino acids with benzophenone and arylazides (including but not limited to, phenylazide) side chains), for example, allow for efficient in vivo and in vitro photocrosslinking of polypeptides. Examples of photoreactive non-natural amino acids include, but are not limited to, p-azido-phenylalanine and p-benzoyl-phenylalanine. The antibodies with the photoreactive non-natural amino acids may then be crosslinked at will by excitation of the photoreactive group-providing temporal control. In a non-limiting example, the methyl group of a non-natural amino can be substituted with an isotopically labeled, including but not limited to, with a methyl group, as a probe of local structure and dynamics, including but not limited to, with the use of nuclear magnetic resonance and vibrational spectroscopy.

Amino acids with an electrophilic reactive group allow for a variety of reactions to link molecules via various chemical reactions, including, but not limited to, nucleophilic addition reactions. Such electrophilic reactive groups include a carbonyl- or dicarbonyl-group (including a keto- or aldehyde group), a carbonyl-like- or dicarbonyl-like-group (which has reactivity similar to a carbonyl- or dicarbonyl-group and is structurally similar to a carbonyl- or dicarbonyl-group), a masked carbonyl- or masked dicarbonyl-group (which can be readily converted into a carbonyl- or dicarbonyl-group), or a protected carbonyl- or protected dicarbonyl-group (which has reactivity similar to a carbonyl- or dicarbonyl-group upon deprotection). Such amino acids include amino acids having the structure of Formula (I):

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; J is

R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; each R″ is independently H, alkyl, substituted alkyl, or a protecting group, or when more than one R″ group is present, two R″ optionally form a heterocycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each of R₃ and R₄ is independently H, halogen, lower alkyl, or substituted lower alkyl, or R₃ and R₄ or two R₃ groups optionally form a cycloalkyl or a heterocycloalkyl; or the -A-B-J-R groups together form a bicyclic or tricyclic cycloalkyl or heterocycloalkyl comprising at least one carbonyl group, including a dicarbonyl group, protected carbonyl group, including a protected dicarbonyl group, or masked carbonyl group, including a masked dicarbonyl group; or the -J-R group together forms a monocyclic or bicyclic cycloalkyl or heterocycloalkyl comprising at least one carbonyl group, including a dicarbonyl group, protected carbonyl group, including a protected dicarbonyl group, or masked carbonyl group, including a masked dicarbonyl group; with a proviso that when A is phenylene and each R₃ is H, B is present; and that when A is —(CH₂)₄— and each R₃ is H, B is not —NHC(O)(CH₂CH₂)—; and that when A and B are absent and each R₃ is H, R is not methyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In certain embodiments, compounds of Formula (I) are stable in aqueous solution for at least 1 month under mildly acidic conditions. In certain embodiments, compounds of Formula (I) are stable for at least 2 weeks under mildly acidic conditions. In certain embodiments, compound of Formula (I) are stable for at least 5 days under mildly acidic conditions. In certain embodiments, such acidic conditions are pH 2 to 8.

In certain embodiments of compounds of Formula (I), B is lower alkylene, substituted lower alkylene, —O-(alkylene or substituted alkylene)-, —C(R′)═N—N(R′)—, —N(R′)CO—, —C(O)—, —C(R′)═N—, —C(O)-(alkylene or substituted alkylene)-, —CON(R′)-(alkylene or substituted alkylene)-, —S(alkylene or substituted alkylene)-, —S(O)(alkylene or substituted alkylene)-, or —S(O)₂(alkylene or substituted alkylene)-. In certain embodiments of compounds of Formula (I), B is —O(CH₂)—, —CH═N—, —CH═N—NH—, —NHCH₂—, —NHCO—, —C(O)—, —C(O)—(CH₂)—, —CONH—(CH₂)—, —SCH₂—, —S(═O)CH₂—, or —S(O)₂CH₂—. In certain embodiments of compounds of Formula (I), R is C₁₋₆ alkyl or cycloalkyl. In certain embodiments of compounds of Formula (I) R is —CH₃, —CH(CH₃)₂, or cyclopropyl. In certain embodiments of compounds of Formula (I), R₁ is H, tert-butyloxycarbonyl (Boc), 9-Fluorenylmethoxycarbonyl (Fmoc), N-acetyl, tetrafluoroacetyl (TFA), or benzyloxycarbonyl (Cbz). In certain embodiments of compounds of Formula (I), R₁ is a resin, amino acid, polypeptide, or polynucleotide. In certain embodiments of compounds of Formula (I), R₂ is OH, O-methyl, O-ethyl, or O-t-butyl. In certain embodiments of compounds of Formula (I), R₂ is a resin, amino acid, polypeptide, or polynucleotide. In certain embodiments of compounds of Formula (I), R₂ is a polynucleotide. In certain embodiments of compounds of Formula (I), R₂ is ribonucleic acid (RNA). In certain embodiments of compounds of Formula (I), R₂ is tRNA. In certain embodiments of compounds of Formula (I), the tRNA specifically recognizes a selector codon. In certain embodiments of compounds of Formula (I) the selector codon is selected from the group consisting of an amber codon, ochre codon, opal codon, a unique codon, a rare codon, an unnatural codon, a five-base codon, and a four-base codon. In certain embodiments of compounds of Formula (I), R₂ is a suppressor tRNA.

In certain embodiments of compounds of Formula (I),

is selected from the group consisting of: (i) A is substituted lower alkylene, C₄-arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a divalent linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S(O)—, —S(O)₂—, —NS(O)₂—, —OS(O)₂—, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —N(R′)—, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —N(R′)C(S)—, —S(O)N(R′), —S(O)₂N(R′), —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)N(R′)—, —N(R′)S(O)₂N(R′)—, —N(R′)—N═, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—; (ii) A is optional, and when present is substituted lower alkylene, C₄-arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is a divalent linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S(O)—, —S(O)₂—, —NS(O)₂—, —OS(O)₂—, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —N(R′)—, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —N(R′)C(S)—, —S(O)N(R′), —S(O)₂N(R′), —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)N(R′)—, —N(R′)S(O)₂N(R′)—, —N(R′)—N═, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—; (iii) A is lower alkylene; B is optional, and when present is a divalent linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S(O)—, —S(O)₂—, —NS(O)₂—, —OS(O)₂—, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —N(R′)—, —C(O)N(R′)—, —CSN(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —N(R′)C(S)—, —S(O)N(R′), —S(O)₂N(R′), —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)N(R′)—, —N(R′)S(O)₂N(R′)—, —N(R′)—N═, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—; and (iv) A is phenylene; B is a divalent linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S(O)—, —S(O)₂—, —NS(O)₂—, —OS(O)₂—, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —N(R′)—, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —N(R′)C(S)—, —S(O)N(R′), —S(O)₂N(R′), —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)N(R′)—, —N(R′)S(O)₂N(R′)—, —N(R′)—N═, —C(R′)′N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—; J is

each R′ is independently H, alkyl, or substituted alkyl; R₁ is optional, and when present, is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is optional, and when present, is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; and each R₃ and R₄ is independently H, halogen, lower alkyl, or substituted lower alkyl; and R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl.

In addition, amino acids having the structure of Formula (II) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide. In certain embodiments, when A is phenylene, B is present; and that when A is —(CH₂)₄—, B is not —NHC(O)(CH₂CH₂)—; and that when A and B are absent, R is not methyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, amino acids having the structure of Formula (III) are included:

wherein: B is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each R_(a) is independently selected from the group consisting of H, halogen, alkyl, substituted alkyl, —N(R′)₂, —C(O)_(k)R′ where k is 1, 2, or 3, —C(O)N(R′)₂, —OR′, and —S(O)_(k)R′, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids are included:

Such non-natural amino acids may be are optionally amino protected group, carboxyl protected and/or in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (IV) are included:

wherein —NS(O)₂—, —OS(O)₂—, optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each R_(a) is independently selected from the group consisting of H, halogen, alkyl, substituted alkyl, —N(R′)₂, —C(O)_(k)R′ where k is 1, 2, or 3, —C(O)N(R′)₂, —OR′, and —S(O)_(k)R′, where each R′ is independently H, alkyl, or substituted alkyl; and n is 0 to 8. In certain embodiments, when A is —(CH₂)₄—, B is not —NHC(O)(CH₂CH₂)—. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids are included:

wherein such compounds are optionally amino protected, optionally carboxyl protected, optionally amino protected and carboxyl protected, or a salt thereof, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (VIII) are included:

wherein, A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (IX) are included:

wherein, B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; wherein each R_(a) is independently selected from the group consisting of H, halogen, alkyl, substituted alkyl, —N(R′)₂, —C(O)_(k)R′ where k is 1, 2, or 3, —C(O)N(R′)₂, —OR′, and —S(O)_(k)R′, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids are included:

wherein such compounds are optionally amino protected, optionally carboxyl protected, optionally amino protected and carboxyl protected, or a salt thereof, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (X) are included:

wherein, B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each R_(a) is independently selected from the group consisting of H, halogen, alkyl, substituted alkyl, —N(R′)₂, —C(O)_(k)R′ where k is 1, 2, or 3, —C(O)N(R′)₂, —OR′, and —S(O)_(k)R′, where each R′ is independently H, alkyl, or substituted alkyl; and n is 0 to 8. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids are included:

wherein such compounds are optionally amino protected, optionally carboxyl protected, optionally amino protected and carboxyl protected, or a salt thereof, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition to monocarbonyl structures, the non-natural amino acids described herein may include groups such as dicarbonyl, dicarbonyl like, masked dicarbonyl and protected dicarbonyl groups. For example, the following amino acids having the structure of Formula (V) are included:

wherein, A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (VI) are included:

wherein, B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; wherein each R_(a) is independently selected from the group consisting of H, halogen, alkyl, substituted alkyl, —N(R′)₂, —C(O)_(k)R′ where k is 1, 2, or 3, —C(O)N(R′)₂, —OR′, and —S(O)_(k)R′, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids are included:

wherein such compounds are optionally amino protected and carboxyl protected, or a salt thereof. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (VII) are included:

wherein, B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each R_(a) is independently selected from the group consisting of H, halogen, alkyl, substituted alkyl, —N(R′)₂, —C(O)_(k)R′ where k is 1, 2, or 3, —C(O)N(R′)₂, —OR′, and —S(O)_(k)R′, where each R′ is independently H, alkyl, or substituted alkyl; and n is 0 to 8. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids are included:

wherein such compounds are optionally amino protected and carboxyl protected, or a salt thereof, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (XXX) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; X₁ is C, S, or S(O); and L is alkylene, substituted alkylene, N(R′)(alkylene) or N(R′)(substituted alkylene), where R′ is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (XXX-A) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; L is alkylene, substituted alkylene, N(R′)(alkylene) or N(R′)(substituted alkylene), where R′ is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (XXX-B) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; L is alkylene, substituted alkylene, N(R′)(alkylene) or N(R′)(substituted alkylene), where R′ is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (XXXI) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; X₁ is C, S, or S(O); and n is 0, 1, 2, 3, 4, or 5; and each R⁸ and R⁹ on each CR⁸R⁹ group is independently selected from the group consisting of H, alkoxy, alkylamine, halogen, alkyl, aryl, or any R⁸ and R⁹ can together form ═O or a cycloalkyl, or any to adjacent R⁸ groups can together form a cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (XXXI-A) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; n is 0, 1, 2, 3, 4, or 5; and each R⁸ and R⁹ on each CR⁸R⁹ group is independently selected from the group consisting of H, alkoxy, alkylamine, halogen, alkyl, aryl, or any R⁸ and R⁹ can together form ═O or a cycloalkyl, or any to adjacent R⁸ groups can together form a cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (XXXI-B) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; n is 0, 1, 2, 3, 4, or 5; and each R⁸ and R⁹ on each CR⁸R⁹ group is independently selected from the group consisting of H, alkoxy, alkylamine, halogen, alkyl, aryl, or any R⁸ and R⁹ can together form ═O or a cycloalkyl, or any to adjacent R⁸ groups can together form a cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (XXXII) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; X₁ is C, S, or S(O); and L is alkylene, substituted alkylene, N(R′)(alkylene) or N(R′)(substituted alkylene), where R′ is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

The In addition, the following amino acids having the structure of Formula (XXXII-A) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; L is alkylene, substituted alkylene, N(R′)(alkylene) or N(R′)(substituted alkylene), where R′ is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, the following amino acids having the structure of Formula (XXXII-B) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; L is alkylene, substituted alkylene, N(R′)(alkylene) or N(R′)(substituted alkylene), where R′ is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, amino acids having the structure of Formula (XXXX) are included:

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene;

where (a) indicates bonding to the A group and (b) indicates bonding to respective carbonyl groups, R₃ and R₄ are independently chosen from H, halogen, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl, or R₃ and R₄ or two R₃ groups or two R₄ groups optionally form a cycloalkyl or a heterocycloalkyl; R is H, halogen, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; T₃ is a bond, C(R)(R), O, or S, and R is H, halogen, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, amino acids having the structure of Formula (XXXXI) are included:

wherein:

where (a) indicates bonding to the A group and (b) indicates bonding to respective carbonyl groups, R₃ and R₄ are independently chosen from H, halogen, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl, or R₃ and R₄ or two R₃ groups or two R₄ groups optionally form a cycloalkyl or a heterocycloalkyl; R is H, halogen, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; T₃ is a bond, C(R)(R), O, or S, and R is H, halogen, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each R_(a) is independently selected from the group consisting of H, halogen, alkyl, substituted alkyl, —N(R′)₂, —C(O)_(k)R′ where k is 1, 2, or 3, —C(O)N(R′)₂, —OR′, and —S(O)_(k)R′, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, amino acids having the structure of Formula (XXXXII) are included:

wherein: R is H, halogen, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; and T₃ is O, or S. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, amino acids having the structure of Formula (XXXXIII) are included:

wherein: R is H, halogen, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl.

In addition, the following amino acids having structures of Formula (XXXXIII) are included:

Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Non-natural amino acids containing a hydroxylamine (also called an aminooxy) group allow for reaction with a variety of electrophilic groups to form conjugates (including but not limited to, with PEG or other water soluble polymers). Like hydrazines, hydrazides and semicarbazides, the enhanced nucleophilicity of the aminooxy group permits it to react efficiently and selectively with a variety of molecules that contain carbonyl- or dicarbonyl-groups, including but not limited to, ketones, aldehydes or other functional groups with similar chemical reactivity. See, e.g., Shao, J. and Tam, J., J. Am. Chem. Soc. 117:3893-3899 (1995); H. Hang and C. Bertozzi, Acc. Chem. Res. 34(9): 727-736 (2001). Whereas the result of reaction with a hydrazine group is the corresponding hydrazone, however, an oxime results generally from the reaction of an aminooxy group with a carbonyl- or dicarbonyl-containing group such as, by way of example, a ketones, aldehydes or other functional groups with similar chemical reactivity.

Thus, in certain embodiments described herein are non-natural amino acids with sidechains comprising a hydroxylamine group, a hydroxylamine-like group (which has reactivity similar to a hydroxylamine group and is structurally similar to a hydroxylamine group), a masked hydroxylamine group (which can be readily converted into a hydroxylamine group), or a protected hydroxylamine group (which has reactivity similar to a hydroxylamine group upon deprotection). Such amino acids include amino acids having the structure of Formula (XIV):

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; K is —NR₆R₇ or —N═CR₆R₇; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each of R₃ and R₄ is independently H, halogen, lower alkyl, or substituted lower alkyl, or R₃ and R₄ or two R₃ groups optionally form a cycloalkyl or a heterocycloalkyl; each of R₆ and R₇ is independently selected from the group consisting of H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, and substituted aralkyl, —C(O)R″, —C(O)₂R″, —C(O)N(R″)₂, wherein each R″ is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, aryl, substituted aryl, heteroaryl, alkaryl, substituted alkaryl, aralkyl, or substituted aralkyl; or R₆ or R₇ is L-X, where X is a selected from the group consisting of a label; a dye; a polymer; a water-soluble polymer; a derivative of polyethylene glycol; a photocrosslinker; a cytotoxic compound; a drug; an affinity label; a photoaffinity label; a reactive compound; a resin; a second protein or polypeptide or polypeptide analog; an antibody or antibody fragment; a metal chelator; a cofactor; a fatty acid; a carbohydrate; a polynucleotide; a DNA; a RNA; an antisense polynucleotide; a saccharide, a water-soluble dendrimer, a cyclodextrin, a biomaterial; a nanoparticle; a spin label; a fluorophore, a metal-containing moiety; a radioactive moiety; a novel functional group; a group that covalently or noncovalently interacts with other molecules; a photocaged moiety; a photoisomerizable moiety; biotin; a biotin analogue; a moiety incorporating a heavy atom; a chemically cleavable group; a photocleavable group; an elongated side chain; a carbon-linked sugar; a redox-active agent; an amino thioacid; a toxic moiety; an isotopically labeled moiety; a biophysical probe; a phosphorescent group; a chemiluminescent group; an electron dense group; a magnetic group; an intercalating group; a chromophore; an energy transfer agent; a biologically active agent; a detectable label; and any combination thereof; and L is optional, and when present is a linker selected from the group consisting of alkylene, substituted alkylene, alkenylene, substituted alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In certain embodiments of compounds of Formula (XIV), A is phenylene or substituted phenylene. In certain embodiments of compounds of Formula (XIV), B is -(alkylene or substituted alkylene)-, —O-(alkylene or substituted alkylene)-, —S-(alkylene or substituted alkylene)-, or —C(O)-(alkylene or substituted alkylene)-. In certain embodiments of compounds of Formula (XIV), B is —O(CH₂)₂—, —S(CH₂)₂—, —NH(CH₂)₂—, —CO(CH₂)₂—, or —(CH₂)_(n)— where n is 1 to 4. In certain embodiments of compounds of Formula (XIV), R₁ is H, tert-butyloxycarbonyl (Boc), 9-Fluorenylmethoxycarbonyl (Fmoc), N-acetyl, tetrafluoroacetyl (TFA), or benzyloxycarbonyl (Cbz). In certain embodiments of compounds of Formula (XIV), R₁ is a resin, amino acid, polypeptide, or polynucleotide. In certain embodiments of compounds of Formula (XIV), wherein R₂ is OH, O-methyl, O-ethyl, or O-t-butyl. In certain embodiments of compounds of Formula (XIV), R₂ is a resin, amino acid, polypeptide, or polynucleotide. In certain embodiments of compounds of Formula (XIV), R₂ is a polynucleotide. In certain embodiments of compounds of Formula (XIV), R₂ is ribonucleic acid (RNA). In certain embodiments of compounds of Formula (XIV), R₂ is tRNA. In certain embodiments of compounds of Formula (XIV), the tRNA specifically recognizes a codon selected from the group consisting of an amber codon, ochre codon, opal codon, a unique codon, a rare codon, an unnatural codon, a five-base codon, and a four-base codon. In certain embodiments of compounds of Formula (XIV), R₂ is a suppressor tRNA. In certain embodiments of compounds of Formula (XIV), each of R₆ and R₇ is independently selected from the group consisting of H, alkyl, substituted alkyl, alkoxy, substituted alkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, and substituted aralkyl. In certain embodiments of compounds of Formula (XIV), each of R₆ and R₇ is independently selected from the group consisting of H, methyl, phenyl, and -[(alkylene or substituted alkylene)-O-(hydrogen, alkyl, or substituted alkyl)]_(x), wherein x is from 1-50. In certain embodiments of compounds of Formula (XIV), K is —NR₆R₇.

In certain embodiments of compounds of Formula (XIV), X is a biologically active agent selected from the group consisting of a peptide, protein, enzyme, antibody, drug, dye, lipid, nucleosides, oligonucleotide, cell, virus, liposome, microparticle, and micelle. In certain embodiments of compounds of Formula (XIV), X is a drug selected from the group consisting of an antibiotic, fungicide, anti-viral agent, anti-inflammatory agent, anti-tumor agent, cardiovascular agent, anti-anxiety agent, hormone, growth factor, and steroidal agent. In certain embodiments of compounds of Formula (XIV), X is an enzyme selected from the group consisting of horseradish peroxidase, alkaline phosphatase, β-galactosidase, and glucose oxidase. In certain embodiments of compounds of Formula (XIV), X is a detectable label selected from the group consisting of a fluorescent, phosphorescent, chemiluminescent, chelating, electron dense, magnetic, intercalating, radioactive, chromophoric, and energy transfer moiety.

In certain embodiments, compounds of Formula (XIV) are stable in aqueous solution for at least 1 month under mildly acidic conditions. In certain embodiments, compounds of Formula (XIV) are stable for at least 2 weeks under mildly acidic conditions. In certain embodiments, compound of Formula (XIV) are stable for at least 5 days under mildly acidic conditions. In certain embodiments, such acidic conditions are pH 2 to 8.

Such amino acids include amino acids having the structure of Formula (XV):

wherein A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each of R₃ and R₄ is independently H, halogen, lower alkyl, or substituted lower alkyl, or R₃ and R₄ or two R₃ groups optionally form a cycloalkyl or a heterocycloalkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

A non-limiting, representative amino acid has the following structure:

Such a non-natural amino acid may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Non-natural amino acids containing an oxime group allow for reaction with a variety of reagents that contain certain reactive carbonyl- or dicarbonyl-groups (including but not limited to, ketones, aldehydes, or other groups with similar reactivity) to form new non-natural amino acids comprising a new oxime group. Such an oxime exchange reaction allow for the further functionalization of non-natural amino acid polypeptides. Further, the original non-natural amino acids containing an oxime group may be useful in their own right as long as the oxime linkage is stable under conditions necessary to incorporate the amino acid into a polypeptide (e.g., the in vivo, in vitro and chemical synthetic methods described herein).

Thus, in certain embodiments described herein are non-natural amino acids with sidechains comprising an oxime group, an oxime-like group (which has reactivity similar to an oxime group and is structurally similar to an oxime group), a masked oxime group (which can be readily converted into an oxime group), or a protected oxime group (which has reactivity similar to an oxime group upon deprotection). Such amino acids include amino acids having the structure of Formula (XI):

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each of R₃ and R₄ is independently H, halogen, lower alkyl, or substituted lower alkyl, or R₃ and R₄ or two R₃ optionally form a cycloalkyl or a heterocycloalkyl; R₅ is H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, alkylalkoxy, substituted alkylalkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, substituted aralkyl, -(alkylene or substituted alkylene)-ON(R″)₂, -(alkylene or substituted alkylene)-C(O)SR″, -(alkylene or substituted alkylene)-S—S-(aryl or substituted aryl), —C(O)R″, —C(O)₂R″, or —C(O)N(R″)₂, wherein each R″ is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, aryl, substituted aryl, heteroaryl, alkaryl, substituted alkaryl, aralkyl, or substituted aralkyl; or R₅ is L-X, where X is a selected from the group consisting of a label; a dye; a polymer; a water-soluble polymer; a derivative of polyethylene glycol; a photocrosslinker; a cytotoxic compound; a drug; an affinity label; a photoaffinity label; a reactive compound; a resin; a second protein or polypeptide or polypeptide analog; an antibody or antibody fragment; a metal chelator; a cofactor; a fatty acid; a carbohydrate; a polynucleotide; a DNA; a RNA; an antisense polynucleotide; a saccharide, a water-soluble dendrimer, a cyclodextrin, a biomaterial; a nanoparticle; a spin label; a fluorophore, a metal-containing moiety; a radioactive moiety; a novel functional group; a group that covalently or noncovalently interacts with other molecules; a photocaged moiety; a photoisomerizable moiety; biotin; a biotin analogue; a moiety incorporating a heavy atom; a chemically cleavable group; a photocleavable group; an elongated side chain; a carbon-linked sugar; a redox-active agent; an amino thioacid; a toxic moiety; an isotopically labeled moiety; a biophysical probe; a phosphorescent group; a chemiluminescent group; an electron dense group; a magnetic group; an intercalating group; a chromophore; an energy transfer agent; a biologically active agent; a detectable label; and any combination thereof; and L is optional, and when present is a linker selected from the group consisting of alkylene, substituted alkylene, alkenylene, substituted alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, -(alkylene or substituted alkylene)-O—N═CR′—, -(alkylene or substituted alkylene)-C(O)NR′-(alkylene or substituted alkylene)-, -(alkylene or substituted alkylene)-S(O)_(k)-(alkylene or substituted alkylene)-S—, -(alkylene or substituted alkylene)-S—S—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; with a proviso that when A and B are absent, R is not methyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In certain embodiments of compounds of Formula (XI), B is —O-(alkylene or substituted alkylene)-. In certain embodiments of compounds of Formula (XI), B is —O(CH₂)—. In certain embodiments of compounds of Formula (XI), R is C₁₋₄ alkyl. In certain embodiments of compounds of Formula (XI), R is —CH₃. In certain embodiments of compounds of Formula (XI), R₁ is H, tert-butyloxycarbonyl (Boc), 9-fluorenylmethoxycarbonyl (Fmoc), N-acetyl, tetrafluoroacetyl (TFA), or benzyloxycarbonyl (Cbz). In certain embodiments of compounds of Formula (XI), R₁ is a resin, amino acid, polypeptide, or polynucleotide. In certain embodiments of compounds of Formula (XI), R₂ is OH, O-methyl, O-ethyl, or O-t-butyl. In certain embodiments of compounds of Formula (XI), R₂ is a resin, amino acid, polypeptide, or polynucleotide. In certain embodiments of compounds of Formula (XI), R₂ is a polynucleotide. In certain embodiments of compounds of Formula (XI), R₂ is ribonucleic acid (RNA). In certain embodiments of compounds of Formula (XI), R₂ is tRNA. In certain embodiments of compounds of Formula (XI), the tRNA specifically recognizes a selector codon. In certain embodiments of compounds of Formula (XI), the selector codon is selected from the group consisting of an amber codon, ochre codon, opal codon, a unique codon, a rare codon, an unnatural codon, a five-base codon, and a four-base codon. In certain embodiments of compounds of Formula (XI), R₂ is a suppressor tRNA. In certain embodiments of compounds of Formula (XI), R₅ is alkylalkoxy, substituted alkylalkoxy, polyalkylene oxide, substituted polyalkylene oxide, or —C(O)₂R″. In certain embodiments of compounds of Formula (XI), R₅ is -[(alkylene or substituted alkylene)-O-(hydrogen, alkyl, or substituted alkyl)]_(x), wherein x is from 1-50. In certain embodiments of compounds of Formula (XI), R₅ is —(CH₂CH₂)—O—CH₃ or —COOH.

In certain embodiments, compounds of Formula (XI) are stable in aqueous solution for at least 1 month under mildly acidic conditions. In certain embodiments, compounds of Formula (XI) are stable for at least 2 weeks under mildly acidic conditions. In certain embodiments, compound of Formula (XI) are stable for at least 5 days under mildly acidic conditions. In certain embodiments, such acidic conditions are pH 2 to 8.

Amino acids of Formula (XI) include amino acids having the structure of Formula (XII):

wherein, B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; R₅ is H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, alkylalkoxy, substituted alkylalkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, substituted aralkyl, -(alkylene or substituted alkylene)-ON(R″)₂, -(alkylene or substituted alkylene)-C(O)SR″, -(alkylene or substituted alkylene)-S—S-(aryl or substituted aryl), —C(O)R″, —C(O)₂R″, or —C(O)N(R″)₂, wherein each R″ is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, aryl, substituted aryl, heteroaryl, alkaryl, substituted alkaryl, aralkyl, or substituted aralkyl; or R₅ is L-X, where X is a selected from the group consisting of a label; a dye; a polymer; a water-soluble polymer; a derivative of polyethylene glycol; a photocrosslinker; a cytotoxic compound; a drug; an affinity label; a photoaffinity label; a reactive compound; a resin; a second protein or polypeptide or polypeptide analog; an antibody or antibody fragment; a metal chelator; a cofactor; a fatty acid; a carbohydrate; a polynucleotide; a DNA; a RNA; an antisense polynucleotide; a saccharide, a water-soluble dendrimer, a cyclodextrin, a biomaterial; a nanoparticle; a spin label; a fluorophore, a metal-containing moiety; a radioactive moiety; a novel functional group; a group that covalently or noncovalently interacts with other molecules; a photocaged moiety; a photoisomerizable moiety; biotin; a biotin analogue; a moiety incorporating a heavy atom; a chemically cleavable group; a photocleavable group; an elongated side chain; a carbon-linked sugar; a redox-active agent; an amino thioacid; a toxic moiety; an isotopically labeled moiety; a biophysical probe; a phosphorescent group; a chemiluminescent group; an electron dense group; a magnetic group; an intercalating group; a chromophore; an energy transfer agent; a biologically active agent; a detectable label; and any combination thereof; and L is optional, and when present is a linker selected from the group consisting of alkylene, substituted alkylene, alkenylene, substituted alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, -(alkylene or substituted alkylene)-O—N═CR′—, -(alkylene or substituted alkylene)-C(O)NR′-(alkylene or substituted alkylene)-, -(alkylene or substituted alkylene)-S(O)_(k)-(alkylene or substituted alkylene)-S—, -(alkylene or substituted alkylene)-S—S—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Such amino acids include amino acids having the structure of Formula (XIII):

wherein, R is H, alkyl, substituted alkyl, cycloalkyl, or substituted cycloalkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; R₅ is H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, alkylalkoxy, substituted alkylalkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, substituted aralkyl, -(alkylene or substituted alkylene)-ON(R″)₂, -(alkylene or substituted alkylene)-C(O)SR″, -(alkylene or substituted alkylene)-S—S-(aryl or substituted aryl), —C(O)R″, —C(O)₂R″, or —C(O)N(R″)₂, wherein each R″ is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, aryl, substituted aryl, heteroaryl, alkaryl, substituted alkaryl, aralkyl, or substituted aralkyl; or R₅ is L-X, where X is a selected from the group consisting of a label; a dye; a polymer; a water-soluble polymer; a derivative of polyethylene glycol; a photocrosslinker; a cytotoxic compound; a drug; an affinity label; a photoaffinity label; a reactive compound; a resin; a second protein or polypeptide or polypeptide analog; an antibody or antibody fragment; a metal chelator; a cofactor; a fatty acid; a carbohydrate; a polynucleotide; a DNA; a RNA; an antisense polynucleotide; a saccharide, a water-soluble dendrimer, a cyclodextrin, a biomaterial; a nanoparticle; a spin label; a fluorophore, a metal-containing moiety; a radioactive moiety; a novel functional group; a group that covalently or noncovalently interacts with other molecules; a photocaged moiety; a photoisomerizable moiety; biotin; a biotin analogue; a moiety incorporating a heavy atom; a chemically cleavable group; a photocleavable group; an elongated side chain; a carbon-linked sugar; a redox-active agent; an amino thioacid; a toxic moiety; an isotopically labeled moiety; a biophysical probe; a phosphorescent group; a chemiluminescent group; an electron dense group; a magnetic group; an intercalating group; a chromophore; an energy transfer agent; a biologically active agent; a detectable label; and any combination thereof; and L is optional, and when present is a linker selected from the group consisting of alkylene, substituted alkylene, alkenylene, substituted alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, -(alkylene or substituted alkylene)-O—N═CR′—, -(alkylene or substituted alkylene)-C(O)NR′-(alkylene or substituted alkylene)-, -(alkylene or substituted alkylene)-S(O)_(k)-(alkylene or substituted alkylene)-S—, -(alkylene or substituted alkylene)-S—S—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Further non-limiting examples of such amino acids include amino acids having the following structures:

Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In addition, such amino acids include amino acids having the structure of Formula (XIV):

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; K is —NR₆R₇ or —N═CR₆R₇; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each of R₃ and R₄ is independently H, halogen, lower alkyl, or substituted lower alkyl, or R₃ and R₄ or two R₃ groups optionally form a cycloalkyl or a heterocycloalkyl; each of R₆ and R₇ is independently selected from the group consisting of H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, and substituted aralkyl, —C(O)R″, —C(O)₂R″, —C(O)N(R″)₂, wherein each R″ is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, aryl, substituted aryl, heteroaryl, alkaryl, substituted alkaryl, aralkyl, or substituted aralkyl; or R₆ or R₇ is L-X, where X is a selected from the group consisting of a label; a dye; a polymer; a water-soluble polymer; a derivative of polyethylene glycol; a photocrosslinker; a cytotoxic compound; a drug; an affinity label; a photoaffinity label; a reactive compound; a resin; a second protein or polypeptide or polypeptide analog; an antibody or antibody fragment; a metal chelator; a cofactor; a fatty acid; a carbohydrate; a polynucleotide; a DNA; a RNA; an antisense polynucleotide; a saccharide, a water-soluble dendrimer, a cyclodextrin, a biomaterial; a nanoparticle; a spin label; a fluorophore, a metal-containing moiety; a radioactive moiety; a novel functional group; a group that covalently or noncovalently interacts with other molecules; a photocaged moiety; a photoisomerizable moiety; biotin; a biotin analogue; a moiety incorporating a heavy atom; a chemically cleavable group; a photocleavable group; an elongated side chain; a carbon-linked sugar; a redox-active agent; an amino thioacid; a toxic moiety; an isotopically labeled moiety; a biophysical probe; a phosphorescent group; a chemiluminescent group; an electron dense group; a magnetic group; an intercalating group; a chromophore; an energy transfer agent; a biologically active agent; a detectable label; and any combination thereof; and L is optional, and when present is a linker selected from the group consisting of alkylene, substituted alkylene, alkenylene, substituted alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Such amino acids further include amino acids having the structure of Formula (XVI):

wherein: A is optional, and when present is lower alkylene, substituted lower alkylene, lower cycloalkylene, substituted lower cycloalkylene, lower alkenylene, substituted lower alkenylene, alkynylene, lower heteroalkylene, substituted heteroalkylene, lower heterocycloalkylene, substituted lower heterocycloalkylene, arylene, substituted arylene, heteroarylene, substituted heteroarylene, alkarylene, substituted alkarylene, aralkylene, or substituted aralkylene; B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each of R₃ and R₄ is independently H, halogen, lower alkyl, or substituted lower alkyl, or R₃ and R₄ or two R₃ optionally form a cycloalkyl or a heterocycloalkyl; each of R₆ and R₇ is independently selected from the group consisting of H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, and substituted aralkyl, —C(O)R″, —C(O)₂R″, —C(O)N(R″)₂, wherein each R″ is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, aryl, substituted aryl, heteroaryl, alkaryl, substituted alkaryl, aralkyl, or substituted aralkyl; or R₆ or R₇ is L-X, where X is a selected from the group consisting of a label; a dye; a polymer; a water-soluble polymer; a derivative of polyethylene glycol; a photocrosslinker; a cytotoxic compound; a drug; an affinity label; a photoaffinity label; a reactive compound; a resin; a second protein or polypeptide or polypeptide analog; an antibody or antibody fragment; a metal chelator; a cofactor; a fatty acid; a carbohydrate; a polynucleotide; a DNA; a RNA; an antisense polynucleotide; a saccharide, a water-soluble dendrimer, a cyclodextrin, a biomaterial; a nanoparticle; a spin label; a fluorophore, a metal-containing moiety; a radioactive moiety; a novel functional group; a group that covalently or noncovalently interacts with other molecules; a photocaged moiety; a photoisomerizable moiety; biotin; a biotin analogue; a moiety incorporating a heavy atom; a chemically cleavable group; a photocleavable group; an elongated side chain; a carbon-linked sugar; a redox-active agent; an amino thioacid; a toxic moiety; an isotopically labeled moiety; a biophysical probe; a phosphorescent group; a chemiluminescent group; an electron dense group; a magnetic group; an intercalating group; a chromophore; an energy transfer agent; a biologically active agent; a detectable label; and any combination thereof; and L is optional, and when present is a linker selected from the group consisting of alkylene, substituted alkylene, alkenylene, substituted alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Further, such amino acids include amino acids having the structure of Formula (XVII):

wherein: B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each of R₆ and R₇ is independently selected from the group consisting of H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, and substituted aralkyl, —C(O)R″, —C(O)₂R″, —C(O)N(R″)₂, wherein each R″ is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, aryl, substituted aryl, heteroaryl, alkaryl, substituted alkaryl, aralkyl, or substituted aralkyl; or R₆ or R₇ is L-X, where X is a selected from the group consisting of a label; a dye; a polymer; a water-soluble polymer; a derivative of polyethylene glycol; a photocrosslinker; a cytotoxic compound; a drug; an affinity label; a photoaffinity label; a reactive compound; a resin; a second protein or polypeptide or polypeptide analog; an antibody or antibody fragment; a metal chelator; a cofactor; a fatty acid; a carbohydrate; a polynucleotide; a DNA; a RNA; an antisense polynucleotide; a saccharide, a water-soluble dendrimer, a cyclodextrin, a biomaterial; a nanoparticle; a spin label; a fluorophore, a metal-containing moiety; a radioactive moiety; a novel functional group; a group that covalently or noncovalently interacts with other molecules; a photocaged moiety; a photoisomerizable moiety; biotin; a biotin analogue; a moiety incorporating a heavy atom; a chemically cleavable group; a photocleavable group; an elongated side chain; a carbon-linked sugar; a redox-active agent; an amino thioacid; a toxic moiety; an isotopically labeled moiety; a biophysical probe; a phosphorescent group; a chemiluminescent group; an electron dense group; a magnetic group; an intercalating group; a chromophore; an energy transfer agent; a biologically active agent; a detectable label; and any combination thereof; and L is optional, and when present is a linker selected from the group consisting of alkylene, substituted alkylene, alkenylene, substituted alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl.

Non-limiting examples of such amino acids include amino acids having the following structures:

Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Additionally, such amino acids include amino acids having the structure of Formula (XVIII):

wherein: B is optional, and when present is a linker selected from the group consisting of lower alkylene, substituted lower alkylene, lower alkenylene, substituted lower alkenylene, lower heteroalkylene, substituted lower heteroalkylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —NS(O)₂—, —OS(O)₂—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; R₁ is H, an amino protecting group, resin, amino acid, polypeptide, or polynucleotide; and R₂ is OH, an ester protecting group, resin, amino acid, polypeptide, or polynucleotide; each of R₆ and R₇ is independently selected from the group consisting of H, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, polyalkylene oxide, substituted polyalkylene oxide, aryl, substituted aryl, heteroaryl, substituted heteroaryl, alkaryl, substituted alkaryl, aralkyl, and substituted aralkyl, —C(O)R″, —C(O)₂R″, —C(O)N(R″)₂, wherein each R″ is independently hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkoxy, substituted alkoxy, aryl, substituted aryl, heteroaryl, alkaryl, substituted alkaryl, aralkyl, or substituted aralkyl; or R₆ or R₇ is L-X, where X is a selected from the group consisting of a label; a dye; a polymer; a water-soluble polymer; a derivative of polyethylene glycol; a photocrosslinker; a cytotoxic compound; a drug; an affinity label; a photoaffinity label; a reactive compound; a resin; a second protein or polypeptide or polypeptide analog; an antibody or antibody fragment; a metal chelator; a cofactor; a fatty acid; a carbohydrate; a polynucleotide; a DNA; a RNA; an antisense polynucleotide; a saccharide, a water-soluble dendrimer, a cyclodextrin, a biomaterial; a nanoparticle; a spin label; a fluorophore, a metal-containing moiety; a radioactive moiety; a novel functional group; a group that covalently or noncovalently interacts with other molecules; a photocaged moiety; a photoisomerizable moiety; biotin; a biotin analogue; a moiety incorporating a heavy atom; a chemically cleavable group; a photocleavable group; an elongated side chain; a carbon-linked sugar; a redox-active agent; an amino thioacid; a toxic moiety; an isotopically labeled moiety; a biophysical probe; a phosphorescent group; a chemiluminescent group; an electron dense group; a magnetic group; an intercalating group; a chromophore; an energy transfer agent; a biologically active agent; a detectable label; and any combination thereof; and L is optional, and when present is a linker selected from the group consisting of alkylene, substituted alkylene, alkenylene, substituted alkenylene, —O—, —O-(alkylene or substituted alkylene)-, —S—, —S-(alkylene or substituted alkylene)-, —S(O)_(k)— where k is 1, 2, or 3, —S(O)_(k)(alkylene or substituted alkylene)-, —C(O)—, —C(O)-(alkylene or substituted alkylene)-, —C(S)—, —C(S)-(alkylene or substituted alkylene)-, —N(R′)—, —NR′-(alkylene or substituted alkylene)-, —C(O)N(R′)—, —CON(R′)-(alkylene or substituted alkylene)-, —CSN(R′)—, —CSN(R′)-(alkylene or substituted alkylene)-, —N(R′)CO-(alkylene or substituted alkylene)-, —N(R′)C(O)O—, —S(O)_(k)N(R′)—, —N(R′)C(O)N(R′)—, —N(R′)C(S)N(R′)—, —N(R′)S(O)_(k)N(R′)—, —N(R′)—N═, —C(R′)═N—, —C(R′)═N—N(R′)—, —C(R′)═N—N═, —C(R′)₂—N═N—, and —C(R′)₂—N(R′)—N(R′)—, where each R′ is independently H, alkyl, or substituted alkyl; and each R_(a) is independently selected from the group consisting of H, halogen, alkyl, substituted alkyl, —N(R′)₂, —C(O)_(k)R′ where k is 1, 2, or 3, —C(O)N(R′)₂, —OR′, and —S(O)_(k)R′; where each R′ is independently H, alkyl, or substituted alkyl and n is 0 to 8. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Non-limiting examples of such amino acids include amino acids having the following structures:

Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

In certain embodiments, the non-natural amino acid can be according to formula XIX:

or a salt thereof, wherein: D is —Ar—W₃— or —W₁—Y₁—C(O)—Y₂—W₂—; Ar is

each of W₁, W₂, and W₃ is independently a single bond or lower alkylene; each X₁ is independently —NH—, —O—, or —S—; each Y₁ is independently a single bond, —NH—, or —O—; each Y₂ is independently a single bond, —NH—, —O—, or an N-linked or C-linked pyrrolidinylene; and one of Z₁, Z₂, and Z₃ is —N— and the others of Z₁, Z₂, and Z₃ are independently —CH—. In certain embodiments, the non-natural amino acid is according to formula XIXa:

where D is a defined in the context of formula XIX. In certain embodiments, the non-natural amino acid is according formula XIXb:

or a salt thereof, wherein W₄ is C₁-C₁₀ alkylene. In a further embodiment, W₄ is C₁-C₅ alkylene. In an embodiment, W₄ is C₁-C₃ alkylene. In an embodiment, W₄ is C₁ alkylene. In particular embodiments, the non-natural amino acid is selected from the group consisting of:

or a salt thereof. Such non-natural amino acids may be in the form of a salt, or may be incorporated into a non-natural amino acid polypeptide, polymer, polysaccharide, or a polynucleotide and optionally post translationally modified.

Linkers and Payloads

In certain embodiments, the modified Fc protein comprises a non-natural amino acid having a reactive group, as described above. One of skill in the art can use the reactive group to link the modified Fc protein to any molecular entity capable of forming a covalent bond to the non-natural amino acid, directly or indirectly via a linker. Accordingly, provided herein are conjugates of a modified Fc protein (also referred to as an Fc protein conjugate), as described herein, linked to one or more payload moieties via one or more non-natural amino acids at site-specific positions. The payload can be linked directly or indirectly to the modified Fc protein, for instance, via a linker.

Any known methods for attaching therapeutic and/or diagnostic molecules to an Fc domain or fragment can be used to attach a payload or conjugate to a modified Fc protein provided herein. For example, conventional approaches for chemical conjugation to the immunoglobulin Fc domain include random coupling to naturally occurring primary amines such as lysine and the amino-terminus or carboxylic acids such as glutamic acid, aspartic acid and the carboxy terminus. Alternatively, semi-selective site-specific coupling may be achieved through N-terminal conjugation under appropriate conditions, or derivatized carbohydrates as found on Fc proteins isolated from eukaryotic sources, or by partial reduction and coupling of native cysteine residues. (E.g., Kim et al., A pharmaceutical composition comprising an immunoglobulin Fc region as a carrier, WO 2005/047337). Additional information can be found in, for example, U.S. Pat. No. 8,008,453 to Gegg et al., entitled “Modified Fc molecules;” U.S. Pat. No. 7,887,809 to Garen et al., entitled “Neovascular-targeted immunoconjugates;” U.S. Pat. No. 8,124,094 to Kim et al., entitled “Immunoglobulin Fc fragment modified by non-peptide polymer and pharmaceutical composition comprising the same;” Carter, 2011, Experimental Cell Research, 317(9):1261-1269; Santi et al., 2012, PNAS 109(16): 6211-6216; and Reichert 2011, MAbs. 3(1): 76-99; each of which is hereby incorporated by reference herein in its entirety.

Useful linkers include those described in the section above. In certain embodiments, the linker is any divalent or multivalent linker known to those of skill in the art. Generally, the linker is capable of forming covalent bonds to the functional moiety R and the alpha carbon of the non-natural amino acid. Useful divalent linkers include a bond, alkylene, substituted alkylene, heteroalkylene, substituted heteroalkylene, arylene, substituted arylene, heteroarlyene and substituted heteroarylene. In certain embodiments, the linker is C₁₋₁₀ alkylene or C₁₋₁₀ heteroalkylene.

The molecular payload can be any molecular entity that one of skill in the art might desire to conjugate to the Fc protein. In certain embodiments, the payload is a therapeutic moiety. In such embodiments, the Fc protein conjugate can be used to target the therapeutic moiety to its molecular target. In certain embodiments, the payload is a labeling moiety. In such embodiments, the Fc protein conjugate can be used to detect binding of the Fc protein to its target. In certain embodiments, the payload is a cytotoxic moiety. In such embodiments, the conjugate can be used target the cytotoxic moiety to a diseased cell, for example, a cancer cell, to initiate destruction or elimination of the cell. Conjugates comprising other molecular payloads apparent to those of skill in the art are within the scope of the conjugates described herein.

In certain embodiments, a conjugate can have a payload selected from the group consisting of a label, a dye, a polymer, a water-soluble polymer, polyethylene glycol, a derivative of polyethylene glycol, a photocrosslinker, a cytotoxic compound, a radionuclide, a drug, an affinity label, a photoaffinity label, a reactive compound, a resin, a second protein or polypeptide or polypeptide analog, an antibody or antibody fragment, a metal chelator, a cofactor, a fatty acid, a carbohydrate, a polynucleotide, a DNA, a RNA, an antisense polynucleotide, a peptide, a water-soluble dendrimer, a cyclodextrin, an inhibitory ribonucleic acid, a biomaterial, a nanoparticle, a spin label, a fluorophore, a metal-containing moiety, a radioactive moiety, a novel functional group, a group that covalently or noncovalently interacts with other molecules, a photocaged moiety, a photoisomerizable moiety, biotin, a derivative of biotin, a biotin analogue, a moiety incorporating a heavy atom, a chemically cleavable group, a photocleavable group, an elongated side chain, a carbon-linked sugar, a redox-active agent, an amino thioacid, a toxic moiety, an isotopically labeled moiety, a biophysical probe, a phosphorescent group, a chemiluminescent group, an electron dense group, a magnetic group, an intercalating group, a chromophore, an energy transfer agent, a biologically active agent, a detectable label, a small molecule, or any combination thereof.

Useful drug paylods include any cytotoxic, cytostatic or immunomodulatory drug. Useful classes of cytotoxic or immunomodulatory agents include, for example, antitubulin agents, auristatins, DNA minor groove binders, DNA replication inhibitors, alkylating agents (e.g., platinum complexes such as cis-platin, mono(platinum), bis(platinum) and tri-nuclear platinum complexes and carboplatin), anthracyclines, antibiotics, antifolates, antimetabolites, calmodulin inhibitors, chemotherapy sensitizers, duocarmycins, etoposides, fluorinated pyrimidines, ionophores, lexitropsins, maytansinoids, nitrosoureas, platinols, pore-forming compounds, purine antimetabolites, puromycins, radiation sensitizers, rapamycins, steroids, taxanes, topoisomerase inhibitors, vinca alkaloids, or the like.

Individual cytotoxic or immunomodulatory agents include, for example, an androgen, anthramycin (AMC), asparaginase, 5-azacytidine, azathioprine, bleomycin, busulfan, buthionine sulfoximine, calicheamicin, calicheamicin derivatives, camptothecin, carboplatin, carmustine (BSNU), CC-1065, chlorambucil, cisplatin, colchicine, cyclophosphamide, cytarabine, cytidine arabinoside, cytochalasin B, dacarbazine, dactinomycin (formerly actinomycin), daunorubicin, decarbazine, DM1, DM4, docetaxel, doxorubicin, etoposide, an estrogen, 5-fluordeoxyuridine, 5-fluorouracil, gemcitabine, gramicidin D, hydroxyurea, idarubicin, ifosfamide, irinotecan, lomustine (CCNU), maytansine, mechlorethamine, melphalan, 6-mercaptopurine, methotrexate, mithramycin, mitomycin C, mitoxantrone, nitroimidazole, paclitaxel, palytoxin, plicamycin, procarbizine, rhizoxin, streptozotocin, tenoposide, 6-thioguanine, thioTEPA, topotecan, vinblastine, vincristine, vinorelbine, VP-16 and VM-26.

In some embodiments, suitable cytotoxic agents include, for example, DNA minor groove binders (e.g., enediynes and lexitropsins, a CBI compound; see also U.S. Pat. No. 6,130,237), duocarmycins, taxanes (e.g., paclitaxel and docetaxel), puromycins, vinca alkaloids, CC-1065, SN-38, topotecan, morpholino-doxorubicin, rhizoxin, cyanomorpholino-doxorubicin, echinomycin, combretastatin, netropsin, epothilone A and B, estramustine, cryptophycins, cemadotin, maytansinoids, discodermolide, eleutherobin, and mitoxantrone.

In some embodiments, the payload is one or more therapeutic peptides. Any peptide that exhibits a therapeutic effect can be included as a payload to a modified Fc protein. The therapeutic peptides can be of any length; for example, a therapeutic peptide can have two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, six or more amino acids, seven or more amino acids, eight or more amino acids, nine or more amino acids, 10 or more amino acids, 11 or more amino acids, 12 or more amino acids, 13 or more amino acids, 14 or more amino acids, 15 or more amino acids, 16 or more amino acids, 17 or more amino acids, 18 or more amino acids, 19 or more amino acids, 20 or more amino acids, 22 or more amino acids, 24 or more amino acids, 26 or more amino acids, 28 or more amino acids, 30 or more amino acids, 32 or more amino acids, 34 or more amino acids, 36 or more amino acids, 38 or more amino acids, 40 or more amino acids, 42 or more amino acids, 44 or more amino acids, 46 or more amino acids, 48 or more amino acids, 50 or more amino acids, 55 or more amino acids, 60 or more amino acids, 65 or more amino acids, 70 or more amino acids, 75 or more amino acids, 80 or more amino acids, 85 or more amino acids, 90 or more amino acids, 95 or more amino acids, 100 or more amino acids, 110 or more amino acids, 120 or more amino acids, 130 or more amino acids, 140 or more amino acids, 150 or more amino acids, 160 or more amino acids, 170 or more amino acids, 180 or more amino acids, 190 or more amino acids, 140 or more amino acids, 150 or more amino acids, 160 or more amino acids, 170 or more amino acids, 180 or more amino acids, 190 or more amino acids, 200 or more amino acids, 220 or more amino acids, 250 or more amino acids, 275 or more amino acids, 300 or more amino acids, 350 or more amino acids, 400 or more amino acids, 450 or more amino acids, or 500 or more amino acids.

In some embodiments, the therapeutic peptide is a fragment of a known protein. For example, the therapeutic peptide can have two or more amino acids, three or more amino acids, four or more amino acids, five or more amino acids, six or more amino acids, seven or more amino acids, eight or more amino acids, nine or more amino acids, 10 or more amino acids, 11 or more amino acids, 12 or more amino acids, 13 or more amino acids, 14 or more amino acids, 15 or more amino acids, 16 or more amino acids, 17 or more amino acids, 18 or more amino acids, 19 or more amino acids, 20 or more amino acids, 22 or more amino acids, 24 or more amino acids, 26 or more amino acids, 28 or more amino acids, 30 or more amino acids, 32 or more amino acids, 34 or more amino acids, 36 or more amino acids, 38 or more amino acids, 40 or more amino acids, 42 or more amino acids, 44 or more amino acids, 46 or more amino acids, 48 or more amino acids, 50 or more amino acids, 55 or more amino acids, 60 or more amino acids, 65 or more amino acids, 70 or more amino acids, 75 or more amino acids, 80 or more amino acids, 85 or more amino acids, 90 or more amino acids, 95 or more amino acids, 100 or more amino acids, 110 or more amino acids, 120 or more amino acids, 130 or more amino acids, 140 or more amino acids, 150 or more amino acids, 160 or more amino acids, 170 or more amino acids, 180 or more amino acids, 190 or more amino acids, 140 or more amino acids, 150 or more amino acids, 160 or more amino acids, 170 or more amino acids, 180 or more amino acids, 190 or more amino acids, 200 or more amino acids, 220 or more amino acids, 250 or more amino acids, 275 or more amino acids, 300 or more amino acids, 350 or more amino acids, 400 or more amino acids, 450 or more amino acids, or 500 or more amino acids from a contiguous fragment of a known protein.

In some embodiments, the sequence of a therapeutic peptide is based on and shared sequence identity with or similarity to a known protein. In some embodiments, the therapeutic peptide has at least about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60% or greater sequence identity with the amino acid sequence of a contiguous fragment of a known protein.

In some embodiments, the therapeutic peptide has at least about 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70% or greater sequence identity with the amino acid sequence of a contiguous fragment of a known protein.

In some embodiments, the therapeutic peptide has at least about 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79% or 80% or greater sequence identity with the amino acid sequence of a contiguous fragment of a known protein.

In some embodiments, the therapeutic peptide has at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, or 90% or greater sequence identity with the amino acid sequence of a contiguous fragment of a known protein.

In some embodiments, the therapeutic peptide has at least about t 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or greater sequence identity with the amino acid sequence of a contiguous fragment of a known protein.

In some embodiments, the therapeutic peptide can include any non-natural or modified amino acid residues known in the art. In advantageous embodiments, the non-natural or modified amino acid facilitates linkage to the modified Fc protein directly via a covalent bond or indirectly via a linker.

Exemplary non-natural amino acids and amino acid analogs that can be used in the therapeutic peptides include, but are not limited to, 2-aminobutyric acid, 2-aminoisobutyric acid, 3-(1-naphthyl)alanine, 3-(2-naphthyl)alanine, 3-methylhistidine, 3-pyridylalanine, 4-chlorophenylalanine, 4-fluorophenylalanine, 4-hydroxyproline, 5-hydroxylysine, alloisoleucine, citrulline, dehydroalanine, homoarginine, homocysteine, homoserine, hydroxyproline, N-acetylserine, N-formylmethionine, N-methylglycine, N-methylisoleucine, norleucine, N-alpha.-methylarginine, O-phosphoserine, ornithine, phenylglycine, pipecolinic acid, piperazic acid, pyroglutamine, sarcosine, valanine, β-alanine, and β-cyclohexylalanine. Further useful amino acids include those described herein.

In certain embodiments, therapeutic peptides can be a cytokine, a growth factor, a factor for regulating replication, transcription, or translation, or a homolog or analog thereof. In particular embodiments, exemplary therapeutic peptides include but are not limited to adrenomedullin (AM), argatroban, angiopoietin (Ang), autocrine motility factor, bone morphogenetic proteins (BMPs), brain-derived neurotrophic factor (BDNF), ciliary neurotrophic factor (CNTF), elafin, epidermal growth factor (EGF), erythropoietin (EPO), exendin-3, exendin-4, fibroblast growth factor (FGF), Glial cell line-derived neurotrophic factor (GDNF), gonadotropin, granulocyte colony-stimulating factor (G-CSF), granulocyte macrophage colony-stimulating factor (GM-CSF), growth differentiation factor-9 (GDF9), hepatocyte growth factor (HGF), hepatoma-derived growth factor (HDGF), insulin-like growth factor (IGF), interferon (IFN), migration-stimulating factor, myostatin (GDF-8), nerve growth factor (NGF) and other neurotrophins, platelet-derived growth factor (PDGF), prolactin, thrombopoietin (TPO), somatotropin, transforming growth factor alpha(TGF-α), transforming growth factor beta(TGF-β), tumor necrosis factor-alpha(TNF-α), vascular endothelial growth factor (VEGF), a wnt Signaling Pathway regulator of homolog thereof, placental growth factor (PlGF), fetal Bovine Somatotrophin (FBS), X-Linked inhibitor of apoptosis protein (XIAP), or an interleukin such as IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, viral IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, and IL-18,

Additional exemplary therapeutic peptides and methods for making the same are described in, for example, United States Patent Publication No. 2007/0003518 by Atkinson et al.; United States Patent Publication No. 2010/0285003 by Eggink et al.; United States Patent Publication No. 2004/0067888 by Tovey et al.; United States Patent Publication No. 20050267028 by M. Virji; and United States Patent Publication No. 2011/0166063 by Bossard et al.; each of which is hereby incorporated by reference herein in its entirety.

Conjugates of therapeutic peptides and modified Fc proteins can be prepared using any method known in the art and described herein. For example, in some embodiments, a therapeutic peptide can be attached to any non-natural amino acid of an Fc protein as described herein. In some embodiments, a therapeutic peptide containing a non-natural amino acid can be linked to a modified Fc protein at a non-natural amino acid of the Fc protein as described herein. Further, in some embodiments, modified Fc proteins can be coupled to any other reactive group, for instance, an amino, carboxy or thiol group, of a therapeutic peptide standard coupling reagents or linkers.

In some embodiments, a targeting moiety is included in an Fc protein conjugate to guide the therapeutic or diagnostic payload to a desired location within a subject. The term “targeting moiety” is used herein to refer to a molecular structure that helps the conjugates of the invention to localize to a targeting area, e.g., help enter a cell, or bind a receptor. Preferably, the targeting moiety comprises a vitamin, antibody, antigen, receptor, DNA, RNA, sialyl Lewis X antigen, hyaluronic acid, sugars, cell specific lectins, steroid or steroid derivative, RGD peptide, ligand for a cell surface receptor, serum component, or combinatorial molecule directed against various intra- or extracellular receptors. The targeting moiety can also comprise a lipid or a phospholipid. Exemplary phospholipids include, without limitation, phosphatidylcholines, phospatidylserine, phospatidylinositol, phospatidylglycerol, and phospatidylethanolamine. These lipids may be in the form of micelles or liposomes and the like. The targeting moiety may further comprise a detectable label or alternately a detectable label may serve as a targeting moiety. When the conjugate has a targeting group comprising a detectable label, the amount and/or distribution/location of the polymer and/or the moiety (e.g., active agent) to which the polymer is coupled can be determined by using a suitable detector. Such labels include, without limitation, fluorescers, chemiluminescers, moieties used in enzyme labeling, colorimetric (e.g., dyes), metal ions, radioactive moieties, gold particles, quantum dots, and the like.

In some embodiments, the payload is an anti-tubulin agent. Examples of anti-tubulin agents include, but are not limited to, taxanes (e.g., Taxol® (paclitaxel), Taxotere® (docetaxel)), T67 (Tularik) and vinca alkyloids (e.g., vincristine, vinblastine, vindesine, and vinorelbine). Other antitubulin agents include, for example, baccatin derivatives, taxane analogs, epothilones (e.g., epothilone A and B), nocodazole, colchicine and colcimid, estramustine, cryptophycins, cemadotin, maytansinoids, combretastatins, discodermolide, and eleutherobin.

In certain embodiments, the cytotoxic agent is a maytansinoid, another group of anti-tubulin agents. For example, in specific embodiments, the maytansinoid can be maytansine or DM-1 (ImmunoGen, Inc.; see also Chari et al., 1992, Cancer Res. 52:127-131).

In some embodiments, the payload is an auristatin, such as auristatin E or a derivative thereof. For example, the auristatin E derivative can be an ester formed between auristatin E and a keto acid. For example, auristatin E can be reacted with paraacetyl benzoic acid or benzoylvaleric acid to produce AEB and AEVB, respectively. Other typical auristatin derivatives include AFP, MMAF, and MMAE. The synthesis and structure of auristatin derivatives are described in U.S. Patent Application Publication Nos. 2003-0083263, 2005-0238649 and 2005-0009751; International Patent Publication No. WO 04/010957, International Patent Publication No. WO 02/088172, and U.S. Pat. Nos. 6,323,315; 6,239,104; 6,034,065; 5,780,588; 5,665,860; 5,663,149; 5,635,483; 5,599,902; 5,554,725; 5,530,097; 5,521,284; 5,504,191; 5,410,024; 5,138,036; 5,076,973; 4,986,988; 4,978,744; 4,879,278; 4,816,444; and 4,486,414.

In some embodiments, the payload comprises a radioisotope. In some embodiments, the payload does not comprise a radioisotope. In some embodiments, the payload is radioactive. In some embodiments, the payload is not radioactive. In some embodiments, the payload comprises a label or tag, created based on fluorescence (e.g., green or red fluorescent protein), resonance, luminescence (e.g., bioluminescence, chemiluminescence, electrochemiluminescence, crystalloluminescence, produced during crystallization, electroluminescence, cathodoluminescence, mechanoluminescence, triboluminescence, fractoluminescence, piezoluminescence, photoluminescence, fluorescence, phosphorescence, radioluminescence, a result of bombardment by ionizing radiation, sonoluminescence, thermoluminescence), chelation, metal (e.g., gold particles), quantum dots, moieties used in enzyme labeling, colorimetric (e.g., dyes), or any specific molecular binding interactions.

In some embodiments, when used for diagnostic or labeling purposes, the payload can be a non-biological moiety. In some embodiments, the payload can be a non-biological moiety attached to a biological moiety such as a peptide, a protein, a nucleic acid or a hybrid thereof.

In some embodiments, the payload is an antimetabolite. The antimetabolite can be, for example, a purine antagonist (e.g., azothioprine or mycophenolate mofetil), a dihydrofolate reductase inhibitor (e.g., methotrexate), acyclovir, ganciclovir, zidovudine, vidarabine, ribavarin, azidothymidine, cytidine arabinoside, amantadine, dideoxyuridine, iododeoxyuridine, poscarnet, or trifluridine.

In other embodiments, the payload is tacrolimus, cyclosporine, FU506 or rapamycin. In further embodiments, the payload is a drug such as aldesleukin, alemtuzumab, alitretinoin, allopurinol, altretamine, amifostine, anastrozole, arsenic trioxide, bexarotene, bexarotene, calusterone, capecitabine, celecoxib, cladribine, Darbepoetin alfa, Denileukin diftitox, dexrazoxane, dromostanolone propionate, epirubicin, Epoetin alfa, estramustine, exemestane, Filgrastim, floxuridine, fludarabine, fulvestrant, gemcitabine, gemtuzumab ozogamicin (MYLOTARG®), goserelin, idarubicin, ifosfamide, imatinib mesylate, Interferon alfa-2a, irinotecan, letrozole, leucovorin, levamisole, meclorethamine or nitrogen mustard, megestrol, mesna, methotrexate, methoxsalen, mitomycin C, mitotane, nandrolone phenpropionate, oprelvekin, oxaliplatin, pamidronate, pegademase, pegaspargase, pegfilgrastim, pentostatin, pipobroman, plicamycin, porfimer sodium, procarbazine, quinacrine, rasburicase, Rituximab, Sargramostim, streptozocin, tamoxifen, temozolomide, teniposide, testolactone, thioguanine, toremifene, Tositumomab, Trastuzumab (HERCEPTIN®), tretinoin, uracil mustard, valrubicin, vinblastine, vincristine, vinorelbine or zoledronate.

In some embodiments, the payload is an immunomodulatory agent. The immunomodulatory agent can be, for example, ganciclovir, etanercept, tacrolimus, cyclosporine, rapamycin, cyclophosphamide, azathioprine, mycophenolate mofetil or methotrexate. Alternatively, the immunomodulatory agent can be, for example, a glucocorticoid (e.g., cortisol or aldosterone) or a glucocorticoid analogue (e.g., prednisone or dexamethasone).

In some embodiments, the immunomodulatory agent is an anti-inflammatory agent, such as arylcarboxylic derivatives, pyrazole-containing derivatives, oxicam derivatives and nicotinic acid derivatives. Classes of anti-inflammatory agents include, for example, cyclooxygenase inhibitors, 5-lipoxygenase inhibitors, and leukotriene receptor antagonists.

Suitable cyclooxygenase inhibitors include meclofenamic acid, mefenamic acid, carprofen, diclofenac, diflunisal, fenbufen, fenoprofen, indomethacin, ketoprofen, nabumetone, sulindac, tenoxicam and tolmetin.

Suitable lipoxygenase inhibitors include redox inhibitors (e.g., catechol butane derivatives, nordihydroguaiaretic acid (NDGA), masoprocol, phenidone, lanopalen, indazolinones, naphazatrom, benzofuranol, alkylhydroxylamine), and non-redox inhibitors (e.g., hydroxythiazoles, methoxyalkylthiazoles, benzopyrans and derivatives thereof, methoxytetrahydropyran, boswellic acids and acetylated derivatives of boswellic acids, and quinolinemethoxyphenylacetic acids substituted with cycloalkyl radicals), and precursors of redox inhibitors.

Other suitable lipoxygenase inhibitors include antioxidants (e.g., phenols, propyl gallate, flavonoids and/or naturally occurring substrates containing flavonoids, hydroxylated derivatives of the flavones, flavonol, dihydroquercetin, luteolin, galangin, orobol, derivatives of chalcone, 4,2′,4′-trihydroxychalcone, ortho-aminophenols, N-hydroxyureas, benzofuranols, ebselen and species that increase the activity of the reducing selenoenzymes), iron chelating agents (e.g., hydroxamic acids and derivatives thereof, N-hydroxyureas, 2-benzyl-1-naphthol, catechols, hydroxylamines, carnosol trolox C, catechol, naphthol, sulfasalazine, zyleuton, 5-hydroxyanthranilic acid and 4-(omega-arylalkyl)phenylalkanoic acids), imidazole-containing compounds (e.g., ketoconazole and itraconazole), phenothiazines, and benzopyran derivatives.

Yet other suitable lipoxygenase inhibitors include inhibitors of eicosanoids (e.g., octadecatetraenoic, eicosatetraenoic, docosapentaenoic, eicosahexaenoic and docosahexaenoic acids and esters thereof, PGE1 (prostaglandin E1), PGA2 (prostaglandin A2), viprostol, 15-monohydroxyeicosatetraenoic, 15-monohydroxy-eicosatrienoic and 15-monohydroxyeicosapentaenoic acids, and leukotrienes B5, C5 and D5), compounds interfering with calcium flows, phenothiazines, diphenylbutylamines, verapamil, fuscoside, curcumin, chlorogenic acid, caffeic acid, 5,8,11,14-eicosatetrayenoic acid (ETYA), hydroxyphenylretinamide, Ionapalen, esculin, diethylcarbamazine, phenantroline, baicalein, proxicromil, thioethers, diallyl sulfide and di-(1-propenyl) sulfide.

Leukotriene receptor antagonists include calcitriol, ontazolast, Bayer Bay-x-1005, Ciba-Geigy CGS-25019C, ebselen, Leo Denmark ETH-615, Lilly LY-293111, Ono ONO-4057, Terumo TMK-688, Boehringer Ingleheim BI-RM-270, Lilly LY 213024, Lilly LY 264086, Lilly LY 292728, Ono ONO LB457, Pfizer 105696, Perdue Frederick PF 10042, Rhone-Poulenc Rorer RP 66153, SmithKline Beecham SB-201146, SmithKline Beecham SB-201993, SmithKline Beecham SB-209247, Searle SC-53228, Sumitamo SM 15178, American Home Products WAY 121006, Bayer Bay-o-8276, Warner-Lambert CI-987, Warner-Lambert CI-987BPC-15LY 223982, Lilly LY 233569, Lilly LY-255283, MacroNex MNX-160, Merck and Co. MK-591, Merck and Co. MK-886, Ono ONO-LB-448, Purdue Frederick PF-5901, Rhone-Poulenc Rorer RG14893, Rhone-Poulenc Rorer RP 66364, Rhone-Poulenc Rorer RP 69698, Shionoogi S-2474, Searle SC-41930, Searle SC-50505, Searle SC-51146, Searle SC-52798, SmithKline Beecham SK&F-104493, Leo Denmark SR-2566, Tanabe T-757 and Teijin TEI-1338.

Other useful drug payloads include chemical compounds useful in the treatment of cancer. Examples of chemotherapeutic agents include Erlotinib (TARCEVA®, Genentech/OSI Pharm.), Bortezomib (VELCADE®, Millennium Pharm.), Fulvestrant (FASLODEX®, AstraZeneca), Sutent (SU11248, Pfizer), Letrozole (FEMARA®, Novartis), Imatinib mesylate (GLEEVEC®, Novartis), PTK787/ZK 222584 (Novartis), Oxaliplatin (Eloxatin®, Sanofi), 5-FU (5-fluorouracil), Leucovorin, Rapamycin (Sirolimus, RAPAMUNE®, Wyeth), Lapatinib (TYKERB®, GSK572016, Glaxo Smith Kline), Lonafarnib (SCH 66336), Sorafenib (BAY43-9006, Bayer Labs), and Gefitinib (IRESSA®, AstraZeneca), AG1478, AG1571 (SU 5271; Sugen), alkylating agents such as thiotepa and CYTOXAN® cyclosphosphamide; alkyl sulfonates such as busulfan, improsulfan and piposulfan; aziridines such as benzodopa, carboquone, meturedopa, and uredopa; ethylenimines and methylamelamines including altretamine, triethylenemelamine, triethyl enephosphoramide, triethylenethiophosphoramide and trimethylomelamine; acetogenins (especially bullatacin and bullatacinone); a camptothecin (including the synthetic analog topotecan); bryostatin; callystatin; CC-1065 (including its adozelesin, carzelesin and bizelesin synthetic analogs); cryptophycins (particularly cryptophycin 1 and cryptophycin 8); dolastatin; duocarmycin (including the synthetic analogs, KW-2189 and CB1-TM1); eleutherobin; pancratistatin; a sarcodictyin; spongistatin; nitrogen mustards such as chlorambucil, chlomaphazine, chlorophosphamide, estramustine, ifosfamide, mechlorethamine, mechlorethamine oxide hydrochloride, melphalan, novembichin, phenesterine, prednimustine, trofosfamide, uracil mustard; nitrosureas such as carmustine, chlorozotocin, fotemustine, lomustine, nimustine, and ranimnustine; antibiotics such as the enediyne antibiotics (e.g., calicheamicin, especially calicheamicin gammall and calicheamicin omegall (Angew Chem. Intl. Ed. Engl. (1994) 33:183-186); dynemicin, including dynemicin A; bisphosphonates, such as clodronate; an esperamicin; as well as neocarzinostatin chromophore and related chromoprotein enediyne antibiotic chromophores), aclacinomysins, actinomycin, authramycin, azaserine, bleomycins, cactinomycin, carabicin, caminomycin, carzinophilin, chromomycinis, dactinomycin, daunorubicin, detorubicin, 6-diazo-5-oxo-L-norleucine, ADRIAMYCIN® (doxorubicin), morpholino-doxorubicin, cyanomorpholino-doxorubicin, 2-pyrrolino-doxorubicin and deoxydoxorubicin), epirubicin, esorubicin, idarubicin, marcellomycin, mitomycins such as mitomycin C, mycophenolic acid, nogalamycin, olivomycins, peplomycin, porfiromycin, puromycin, quelamycin, rodorubicin, streptonigrin, streptozocin, tubercidin, ubenimex, zinostatin, zorubicin; anti-metabolites such as methotrexate and 5-fluorouracil (5-FU); folic acid analogs such as denopterin, methotrexate, pteropterin, trimetrexate; purine analogs such as fludarabine, 6-mercaptopurine, thiamniprine, thioguanine; pyrimidine analogs such as ancitabine, azacitidine, 6-azauridine, carmofur, cytarabine, dideoxyuridine, doxifluridine, enocitabine, floxuridine; androgens such as calusterone, dromostanolone propionate, epitiostanol, mepitiostane, testolactone; anti-adrenals such as aminoglutethimide, mitotane, trilostane; folic acid replenisher such as frolinic acid; aceglatone; aldophosphamide glycoside; aminolevulinic acid; eniluracil; amsacrine; bestrabucil; bisantrene; edatraxate; defofamine; demecolcine; diaziquone; elformithine; elliptinium acetate; an epothilone; etoglucid; gallium nitrate; hydroxyurea; lentinan; lonidainine; maytansinoids such as maytansine and ansamitocins; mitoguazone; mitoxantrone; mopidanmol; nitraerine; pentostatin; phenamet; pirarubicin; losoxantrone; podophyllinic acid; 2-ethylhydrazide; procarbazine; PSK® polysaccharide complex (JHS Natural Products, Eugene, Oreg.); razoxane; rhizoxin; sizofuran; spirogermanium; tenuazonic acid; triaziquone; 2,2′,2″-trichlorotriethylamine; trichothecenes (especially T-2 toxin, verracurin A, roridin A and anguidine); urethan; vindesine; dacarbazine; mannomustine; mitobronitol; mitolactol; pipobroman; gacytosine; arabinoside (“Ara-C”); cyclophosphamide; thiotepa; taxoids, e.g., TAXOL® (paclitaxel; Bristol-Myers Squibb Oncology, Princeton, N.J.), ABRAXANE® (Cremophor-free), albumin-engineered nanoparticle formulations of paclitaxel (American Pharmaceutical Partners, Schaumberg, Ill.), and TAXOTERE® (doxetaxel; Rhone-Poulenc Rorer, Antony, France); chloranmbucil; GEMZAR® (gemcitabine); 6-thioguanine; mercaptopurine; methotrexate; platinum analogs such as cisplatin and carboplatin; vinblastine; etoposide (VP-16); ifosfamide; mitoxantrone; vincristine; NAVELBINE® (vinorelbine); novantrone; teniposide; edatrexate; daunomycin; aminopterin; capecitabine (XELODA®); ibandronate; CPT-11; topoisomerase inhibitor RFS 2000; difluoromethylornithine (DMFO); retinoids such as retinoic acid; and pharmaceutically acceptable salts, acids and derivatives of any of the above.

Other useful payloads include: (i) anti-hormonal agents that act to regulate or inhibit hormone action on tumors such as anti-estrogens and selective estrogen receptor modulators (SERMs), including, for example, tamoxifen (including NOLVADEX®; tamoxifen citrate), raloxifene, droloxifene, 4-hydroxytamoxifen, trioxifene, keoxifene, LY117018, onapristone, and FARESTON® (toremifine citrate); (ii) aromatase inhibitors that inhibit the enzyme aromatase, which regulates estrogen production in the adrenal glands, such as, for example, 4(5)-imidazoles, aminoglutethimide, MEGASE® (megestrol acetate), AROMASIN® (exemestane; Pfizer), formestanie, fadrozole, RIVISOR® (vorozole), FEMARA® (letrozole; Novartis), and ARIMIDEX® (anastrozole; AstraZeneca); (iii) anti-androgens such as flutamide, nilutamide, bicalutamide, leuprolide, and goserelin; as well as troxacitabine (a 1,3-dioxolane nucleoside cytosine analog); (iv) protein kinase inhibitors; (v) lipid kinase inhibitors; (vi) antisense oligonucleotides, particularly those which inhibit expression of genes in signaling pathways implicated in aberrant cell proliferation, such as, for example, PKC-alpha, Ralf and H-Ras; (vii) ribozymes such as VEGF expression inhibitors (e.g., ANGIOZYME®) and HER2 expression inhibitors; (viii) vaccines such as gene therapy vaccines, for example, ALLOVECTIN®, LEUVECTIN®, and VAXID®; PROLEUKIN® rIL-2; a topoisomerase 1 inhibitor such as LURTOTECAN®; ABARELIX® rmRH; (ix) anti-angiogenic agents such as bevacizumab (AVASTIN®, Genentech); and (x) pharmaceutically acceptable salts, acids and derivatives of any of the above. Other anti-angiogenic agents include MMP-2 (matrix-metalloproteinase 2) inhibitors, MMP-9 (matrix-metalloproteinase 9) inhibitors, COX-II (cyclooxygenase II) inhibitors, and VEGF receptor tyrosine kinase inhibitors. Examples of such useful matrix metalloproteinase inhibitors that can be used in combination with the present compounds/compositions are described in WO 96/33172, WO 96/27583, EP 818442, EP 1004578, WO 98/07697, WO 98/03516, WO 98/34918, WO 98/34915, WO 98/33768, WO 98/30566, EP 606,046, EP 931,788, WO 90/05719, WO 99/52910, WO 99/52889, WO 99/29667, WO 99/07675, EP 945864, U.S. Pat. No. 5,863,949, U.S. Pat. No. 5,861,510, and EP 780,386, all of which are incorporated herein in their entireties by reference. Examples of VEGF receptor tyrosine kinase inhibitors include 4-(4-bromo-2-fluoroanilino)-6-methoxy-7-(1-methylpiperidin-4-ylmethoxy)qu-inazoline (ZD6474; Example 2 within WO 01/32651), 4-(4-fluoro-2-methylindol-5-yloxy)-6-methoxy-7-(3-pyrrolidin-1-ylpropoxy)-quinazoline (AZD2171; Example 240 within WO 00/47212), vatalanib (PTK787; WO 98/35985) and SU11248 (sunitinib; WO 01/60814), and compounds such as those disclosed in PCT Publication Nos. WO 97/22596, WO 97/30035, WO 97/32856, and WO 98/13354).

In certain embodiments, the payload is an antibody or an antibody fragment. In certain embodiments, the payload antibody or fragment can be encoded by any of the immunoglobulin genes recognized by those of skill in the art. The immunoglobulin genes include, but are not limited to, the κ, λ, α, γ (IgG1, IgG2, IgG3, and IgG4), δ, ε and μ constant region genes, as well as the immunoglobulin variable region genes. The term includes full-length antibodies and antibody fragments recognized by those of skill in the art, and variants thereof. Exemplary fragments include but are not limited to Fv, Fc, Fab, and (Fab′)₂, single chain Fv (scFv), diabodies, triabodies, tetrabodies, bifunctional hybrid antibodies, CDR1, CDR2, CDR3, combinations of CDR's, variable regions, framework regions, constant regions, and the like.

In certain embodiments, the payload can be one or more water-soluble polymers. A wide variety of macromolecular polymers and other molecules can be linked to antigen-binding polypeptides of the present invention to modulate biological properties of the antibody, and/or provide new biological properties to the antibody. These macromolecular polymers can be linked to the antibody via a naturally encoded amino acid, via a non-naturally encoded amino acid, or any functional substituent of a natural or non-natural amino acid, or any substituent or functional group added to a natural or non-natural amino acid. The molecular weight of the polymer may be of a wide range, including but not limited to, between about 100 Da and about 100,000 Da or more.

The polymer selected may be water soluble so that the protein to which it is attached does not precipitate in an aqueous environment, such as a physiological environment. The polymer may be branched or unbranched. Preferably, for therapeutic use of the end-product preparation, the polymer will be pharmaceutically acceptable.

The proportion of polyethylene glycol molecules to antibody molecules will vary, as will their concentrations in the reaction mixture. In general, the optimum ratio (in terms of efficiency of reaction in that there is minimal excess unreacted protein or polymer) may be determined by the molecular weight of the polyethylene glycol selected and on the number of available reactive groups available. As relates to molecular weight, typically the higher the molecular weight of the polymer, the fewer number of polymer molecules which may be attached to the protein. Similarly, branching of the polymer should be taken into account when optimizing these parameters. Generally, the higher the molecular weight (or the more branches) the higher the polymer:protein ratio.

The water soluble polymer may be any structural form including but not limited to linear, forked or branched. Typically, the water-soluble polymer is a poly(alkylene glycol), such as poly(ethylene glycol) (PEG), but other water-soluble polymers can also be employed. By way of example, PEG is used to describe certain embodiments of this invention.

PEG is a well-known, water soluble polymer that is commercially available or can be prepared by ring-opening polymerization of ethylene glycol according to methods well known in the art (Sandler and Karo, Polymer Synthesis, Academic Press, New York, Vol. 3, pages 138-161). The term “PEG” is used broadly to encompass any polyethylene glycol molecule, without regard to size or to modification at an end of the PEG, and can be represented as linked to the antibody by the formula: XO—(CH₂CH₂O)_(=n)—CH₂CH₂—Y where n is 2 to 10,000 and X is H or a terminal modification, including but not limited to, a C₁₋₄ alkyl.

In some cases, a PEG used in the invention terminates on one end with hydroxy or methoxy, i.e., X is H or CH₃ (“methoxy PEG”). Alternatively, the PEG can terminate with a reactive group, thereby forming a bifunctional polymer. Typical reactive groups can include those reactive groups that are commonly used to react with the functional groups found in the 20 common amino acids (including but not limited to, maleimide groups, activated carbonates (including but not limited to, p-nitrophenyl ester), activated esters (including but not limited to, N-hydroxysuccinimide, p-nitrophenyl ester) and aldehydes) as well as functional groups that are inert to the 20 common amino acids but that react specifically with complementary functional groups present in non-naturally encoded amino acids (including but not limited to, azide groups, alkyne groups). It is noted that the other end of the PEG, which is shown in the above formula by Y, will attach either directly or indirectly to an antigen-binding polypeptide via a naturally-occurring or non-naturally encoded amino acid. For instance, Y may be an amide, carbamate or urea linkage to an amine group (including but not limited to, the epsilon amine of lysine or the N-terminus) of the polypeptide. Alternatively, Y may be a maleimide linkage to a thiol group (including but not limited to, the thiol group of cysteine). Alternatively, Y may be a linkage to a residue not commonly accessible via the 20 common amino acids. For example, an azide group on the PEG can be reacted with an alkyne group on the antibody to form a Huisgen [3+2]cycloaddition product. Alternatively, an alkyne group on the PEG can be reacted with an azide group present in a non-naturally encoded amino acid to form a similar product. In some embodiments, a strong nucleophile (including but not limited to, hydrazine, hydrazide, hydroxylamine, semicarbazide) can be reacted with an aldehyde or ketone group present in a non-naturally encoded amino acid to form a hydrazone, oxime or semicarbazone, as applicable, which in some cases can be further reduced by treatment with an appropriate reducing agent. Alternatively, the strong nucleophile can be incorporated into the antibody via a non-naturally encoded amino acid and used to react preferentially with a ketone or aldehyde group present in the water soluble polymer.

Any molecular mass for a PEG can be used as practically desired, including but not limited to, from about 100 Daltons (Da) to 100,000 Da or more as desired (including but not limited to, sometimes 0.1-50 kDa or 10-40 kDa). Branched chain PEGs, including but not limited to, PEG molecules with each chain having a MW ranging from 1-100 kDa (including but not limited to, 1-50 kDa or 5-20 kDa) can also be used. A wide range of PEG molecules are described in, including but not limited to, the Shearwater Polymers, Inc. catalog, Nektar Therapeutics catalog, incorporated herein by reference.

Generally, at least one terminus of the PEG molecule is available for reaction with the non-naturally-encoded amino acid. For example, PEG derivatives bearing alkyne and azide moieties for reaction with amino acid side chains can be used to attach PEG to non-naturally encoded amino acids as described herein. If the non-naturally encoded amino acid comprises an azide, then the PEG will typically contain either an alkyne moiety to effect formation of the [3+2] cycloaddition product or an activated PEG species (i.e., ester, carbonate) containing a phosphine group to effect formation of the amide linkage. Alternatively, if the non-naturally encoded amino acid comprises an alkyne, then the PEG will typically contain an azide moiety to effect formation of the [3+2] Huisgen cycloaddition product. If the non-naturally encoded amino acid comprises a carbonyl group, the PEG will typically comprise a potent nucleophile (including but not limited to, a hydrazide, hydrazine, hydroxylamine, or semicarbazide functionality) in order to effect formation of corresponding hydrazone, oxime, and semicarbazone linkages, respectively. In other alternatives, a reverse of the orientation of the reactive groups described above can be used, i.e., an azide moiety in the non-naturally encoded amino acid can be reacted with a PEG derivative containing an alkyne.

In some embodiments, the Fc protein variant with a PEG derivative contains a chemical functionality that is reactive with the chemical functionality present on the side chain of the non-naturally encoded amino acid.

In certain embodiments, the payload is an azide- or acetylene-containing polymer comprising a water soluble polymer backbone having an average molecular weight from about 800 Da to about 100,000 Da. The polymer backbone of the water-soluble polymer can be poly(ethylene glycol). However, it should be understood that a wide variety of water soluble polymers including but not limited to poly(ethylene)glycol and other related polymers, including poly(dextran) and poly(propylene glycol), are also suitable for use in the practice of this invention and that the use of the term PEG or poly(ethylene glycol) is intended to encompass and include all such molecules. The term PEG includes, but is not limited to, poly(ethylene glycol) in any of its forms, including bifunctional PEG, multiarmed PEG, derivatized PEG, forked PEG, branched PEG, pendent PEG (i.e. PEG or related polymers having one or more functional groups pendent to the polymer backbone), or PEG with degradable linkages therein.

The polymer backbone can be linear or branched. Branched polymer backbones are generally known in the art. Typically, a branched polymer has a central branch core moiety and a plurality of linear polymer chains linked to the central branch core. PEG is commonly used in branched forms that can be prepared by addition of ethylene oxide to various polyols, such as glycerol, glycerol oligomers, pentaerythritol and sorbitol. The central branch moiety can also be derived from several amino acids, such as lysine. The branched poly(ethylene glycol) can be represented in general form as R(-PEG-OH)_(m) in which R is derived from a core moiety, such as glycerol, glycerol oligomers, or pentaerythritol, and m represents the number of arms. Multi-armed PEG molecules, such as those described in U.S. Pat. Nos. 5,932,462, 5,643,575; 5,229,490; 4,289,872; U.S. Pat. Appl. 2003/0143596; WO 96/21469; and WO 93/21259, each of which is incorporated by reference herein in its entirety, can also be used as the polymer backbone.

Branched PEG can also be in the form of a forked PEG represented by PEG(-YCHZ₂)_(n), where Y is a linking group and Z is an activated terminal group linked to CH by a chain of atoms of defined length.

Yet another branched form, the pendant PEG, has reactive groups, such as carboxyl, along the PEG backbone rather than at the end of PEG chains.

In addition to these forms of PEG, the polymer can also be prepared with weak or degradable linkages in the backbone. For example, PEG can be prepared with ester linkages in the polymer backbone that are subject to hydrolysis. As shown below, this hydrolysis results in cleavage of the polymer into fragments of lower molecular weight: -PEG-CO₂—PEG-+H₂O→PEG-CO₂H+HO-PEG- It is understood by those skilled in the art that the term poly(ethylene glycol) or PEG represents or includes all the forms known in the art including but not limited to those disclosed herein.

Many other polymers are also suitable for use in the present invention. In some embodiments, polymer backbones that are water-soluble, with from 2 to about 300 termini, are particularly useful in the invention. Examples of suitable polymers include, but are not limited to, other poly(alkylene glycols), such as poly(propylene glycol) (“PPG”), copolymers thereof (including but not limited to copolymers of ethylene glycol and propylene glycol), terpolymers thereof, mixtures thereof, and the like. Although the molecular weight of each chain of the polymer backbone can vary, it is typically in the range of from about 800 Da to about 100,000 Da, often from about 6,000 Da to about 80,000 Da.

Those of ordinary skill in the art will recognize that the foregoing list for substantially water soluble backbones is by no means exhaustive and is merely illustrative, and that all polymeric materials having the qualities described above are contemplated as being suitable for use in the present invention.

In some embodiments of the present invention the polymer derivatives are “multi-functional”, meaning that the polymer backbone has at least two termini, and possibly as many as about 300 termini, functionalized or activated with a functional group. Multifunctional polymer derivatives include, but are not limited to, linear polymers having two termini, each terminus being bonded to a functional group which may be the same or different.

In one embodiment, the polymer derivative has the structure: X-A-POLY-B —N═N═N wherein: N═N═N is an azide moiety; B is a linking moiety, which may be present or absent; POLY is a water-soluble non-antigenic polymer; A is a linking moiety, which may be present or absent and which may be the same as B or different; and X is a second functional group. Examples of a linking moiety for A and B include, but are not limited to, a multiply-functionalized alkyl group containing up to 18, and more preferably between 1-10 carbon atoms. A heteroatom such as nitrogen, oxygen or sulfur may be included with the alkyl chain. The alkyl chain may also be branched at a heteroatom. Other examples of a linking moiety for A and B include, but are not limited to, a multiply functionalized aryl group, containing up to 10 and more preferably 5-6 carbon atoms. The aryl group may be substituted with one more carbon atoms, nitrogen, oxygen or sulfur atoms. Other examples of suitable linking groups include those linking groups described in U.S. Pat. Nos. 5,932,462; 5,643,575; and U.S. Pat. Appl. Publication 2003/0143596, each of which is incorporated by reference herein. Those of ordinary skill in the art will recognize that the foregoing list for linking moieties is by no means exhaustive and is merely illustrative, and that all linking moieties having the qualities described above are contemplated to be suitable for use in the present invention.

Examples of suitable functional groups for use as X include, but are not limited to, hydroxyl, protected hydroxyl, alkoxyl, active ester, such as N-hydroxysuccinimidyl esters and 1-benzotriazolyl esters, active carbonate, such as N-hydroxysuccinimidyl carbonates and 1-benzotriazolyl carbonates, acetal, aldehyde, aldehyde hydrates, alkenyl, acrylate, methacrylate, acrylamide, active sulfone, amine, aminooxy, protected amine, hydrazide, protected hydrazide, protected thiol, carboxylic acid, protected carboxylic acid, isocyanate, isothiocyanate, maleimide, vinylsulfone, dithiopyridine, vinylpyridine, iodoacetamide, epoxide, glyoxals, diones, mesylates, tosylates, tresylate, alkene, ketone, and azide. As is understood by those skilled in the art, the selected X moiety should be compatible with the azide group so that reaction with the azide group does not occur. The azide-containing polymer derivatives may be homobifunctional, meaning that the second functional group (i.e., X) is also an azide moiety, or heterobifunctional, meaning that the second functional group is a different functional group.

The term “protected” refers to the presence of a protecting group or moiety that prevents reaction of the chemically reactive functional group under certain reaction conditions. The protecting group will vary depending on the type of chemically reactive group being protected. For example, if the chemically reactive group is an amine or a hydrazide, the protecting group can be selected from the group of tert-butyloxycarbonyl (t-Boc) and 9-fluorenylmethoxycarbonyl (Fmoc). If the chemically reactive group is a thiol, the protecting group can be orthopyridyldisulfide. If the chemically reactive group is a carboxylic acid, such as butanoic or propionic acid, or a hydroxyl group, the protecting group can be benzyl or an alkyl group such as methyl, ethyl, or tert-butyl. Other protecting groups known in the art may also be used in the present invention.

Specific examples of terminal functional groups in the literature include, but are not limited to, N-succinimidyl carbonate (see e.g., U.S. Pat. Nos. 5,281,698, 5,468,478), amine (see, e.g., Buckmann et al. Makromol. Chem. 182:1379 (1981), Zaplipsky et al. Eur. Polym. J. 19:1177 (1983)), hydrazide (See, e.g., Andresz et al. Makromol. Chem. 179:301 (1978)), succinimidyl propionate and succinimidyl butanoate (see, e.g., Olson et al. in Poly(ethylene glycol) Chemistry & Biological Applications, pp 170-181, Harris & Zaplipsky Eds., ACS, Washington, D.C., 1997; see also U.S. Pat. No. 5,672,662), succinimidyl succinate (See, e.g., Abuchowski et al. Cancer Biochem. Biophys. 7:175 (1984) and Joppich et al. Macrolol. Chem. 180:1381 (1979), succinimidyl ester (see, e.g., U.S. Pat. No. 4,670,417), benzotriazole carbonate (see, e.g., U.S. Pat. No. 5,650,234), glycidyl ether (see, e.g., Pitha et al. Eur. J Biochem. 94:11 (1979), Elling et al., Biotech. Appl. Biochem. 13:354 (1991), oxycarbonylimidazole (see, e.g., Beauchamp, et al., Anal. Biochem. 131:25 (1983), Tondelli et al. J. Controlled Release 1:251 (1985)), p-nitrophenyl carbonate (see, e.g., Veronese, et al., Appl. Biochem. Biotech., 11: 141 (1985); and Sartore et al., Appl. Biochem. Biotech., 27:45 (1991)), aldehyde (see, e.g., Harris et al. J. Polym. Sci. Chem. Ed. 22:341 (1984), U.S. Pat. No. 5,824,784, U.S. Pat. No. 5,252,714), maleimide (see, e.g., Goodson et al. Bio/Technology 8:343 (1990), Romani et al. in Chemistry of Peptides and Proteins 2:29 (1984)), and Kogan, Synthetic Comm. 22:2417 (1992)), orthopyridyl-disulfide (see, e.g., Woghiren, et al. Bioconj. Chem. 4:314(1993)), acrylol (see, e.g., Sawhney et al., Macromolecules, 26:581 (1993)), vinylsulfone (see, e.g., U.S. Pat. No. 5,900,461). All of the above references and patents are incorporated herein by reference.

In certain embodiments of the present invention, the polymer derivatives of the invention comprise a polymer backbone having the structure: X—CH₂CH₂O—(CH₂CH₂O)_(n)—CH₂CH₂—N═N═N wherein: X is a functional group as described above; and n is about 20 to about 4000. In another embodiment, the polymer derivatives of the invention comprise a polymer backbone having the structure: X—CH₂CH₂O—(CH₂CH₂O)_(n)—CH₂CH₂—O—(CH-₂)_(m)—W—N═N═N wherein: W is an aliphatic or aromatic linker moiety comprising between 1-10 carbon atoms; n is about 20 to about 4000; X is a functional group as described above; and m is between 1 and 10.

The azide-containing PEG derivatives of the invention can be prepared by a variety of methods known in the art and/or disclosed herein. In one method, shown below, a water soluble polymer backbone having an average molecular weight from about 800 Da to about 100,000 Da, the polymer backbone having a first terminus bonded to a first functional group and a second terminus bonded to a suitable leaving group, is reacted with an azide anion (which may be paired with any of a number of suitable counter-ions, including sodium, potassium, tert-butylammonium and so forth). The leaving group undergoes a nucleophilic displacement and is replaced by the azide moiety, affording the desired azide-containing PEG polymer as shown in the following: X-PEG-L+N₃ ⁻→X-PEG-N₃.

As shown, a suitable polymer backbone for use in the present invention has the formula X-PEG-L, wherein PEG is poly(ethylene glycol) and X is a functional group that does not react with azide groups and L is a suitable leaving group. Examples of suitable functional groups include, but are not limited to, hydroxyl, protected hydroxyl, acetal, alkenyl, amine, aminooxy, protected amine, protected hydrazide, protected thiol, carboxylic acid, protected carboxylic acid, maleimide, dithiopyridine, and vinylpyridine, and ketone. Examples of suitable leaving groups include, but are not limited to, chloride, bromide, iodide, mesylate, tresylate, and tosylate.

In another method for preparation of the azide-containing polymer derivatives of the present invention, a linking agent bearing an azide functionality is contacted with a water soluble polymer backbone having an average molecular weight from about 800 Da to about 100,000 Da, wherein the linking agent bears a chemical functionality that will react selectively with a chemical functionality on the PEG polymer, to form an azide-containing polymer derivative product wherein the azide is separated from the polymer backbone by a linking group.

An exemplary reaction scheme is shown below: X-PEG-M+N-linker-N═N═N→PG-X-PEG-linker-N═N═N wherein: PEG is poly(ethylene glycol) and X is a capping group such as alkoxy or a functional group as described above; and M is a functional group that is not reactive with the azide functionality but that will react efficiently and selectively with the N functional group.

Examples of suitable functional groups include, but are not limited to, M being a carboxylic acid, carbonate or active ester if N is an amine; M being a ketone if N is a hydrazide or aminooxy moiety; M being a leaving group if N is a nucleophile.

Purification of the crude product may be accomplished by known methods including, but are not limited to, precipitation of the product followed by chromatography, if necessary.

A more specific example is shown below in the case of PEG diamine, in which one of the amines is protected by a protecting group moiety such as tert-butyl-Boc and the resulting mono-protected PEG diamine is reacted with a linking moiety that bears the azide functionality: BocHN-PEG-NH₂+HO₂C—(CH₂)₃—N═N═N.

In this instance, the amine group can be coupled to the carboxylic acid group using a variety of activating agents such as thionyl chloride or carbodiimide reagents and N-hydroxysuccinimide or N-hydroxybenzotriazole to create an amide bond between the monoamine PEG derivative and the azide-bearing linker moiety. After successful formation of the amide bond, the resulting N-tert-butyl-Boc-protected azide-containing derivative can be used directly to modify bioactive molecules or it can be further elaborated to install other useful functional groups. For instance, the N-t-Boc group can be hydrolyzed by treatment with strong acid to generate an omega-amino-PEG-azide. The resulting amine can be used as a synthetic handle to install other useful functionality such as maleimide groups, activated disulfides, activated esters and so forth for the creation of valuable heterobifunctional reagents.

Heterobifunctional derivatives are particularly useful when it is desired to attach different molecules to each terminus of the polymer. For example, the omega-N-amino-N-azido PEG would allow the attachment of a molecule having an activated electrophilic group, such as an aldehyde, ketone, activated ester, activated carbonate and so forth, to one terminus of the PEG and a molecule having an acetylene group to the other terminus of the PEG.

In another embodiment of the invention, the polymer derivative has the structure: X-A-POLY-B—C═C—R wherein: R can be either H or an alkyl, alkene, alkyoxy, or aryl or substituted aryl group; B is a linking moiety, which may be present or absent; POLY is a water-soluble non-antigenic polymer; A is a linking moiety, which may be present or absent and which may be the same as B or different; and X is a second functional group.

Examples of a linking moiety for A and B include, but are not limited to, a multiply-functionalized alkyl group containing up to 18, and more preferably between 1-10 carbon atoms. A heteroatom such as nitrogen, oxygen or sulfur may be included with the alkyl chain. The alkyl chain may also be branched at a heteroatom. Other examples of a linking moiety for A and B include, but are not limited to, a multiply functionalized aryl group, containing up to 10 and more preferably 5-6 carbon atoms. The aryl group may be substituted with one more carbon atoms, nitrogen, oxygen, or sulfur atoms. Other examples of suitable linking groups include those linking groups described in U.S. Pat. Nos. 5,932,462 and 5,643,575 and U.S. Pat. Appl. Publication 2003/0143596, each of which is incorporated by reference herein. Those of ordinary skill in the art will recognize that the foregoing list for linking moieties is by no means exhaustive and is intended to be merely illustrative, and that a wide variety of linking moieties having the qualities described above are contemplated to be useful in the present invention.

Examples of suitable functional groups for use as X include hydroxyl, protected hydroxyl, alkoxyl, active ester, such as N-hydroxysuccinimidyl esters and 1-benzotriazolyl esters, active carbonate, such as N-hydroxysuccinimidyl carbonates and 1-benzotriazolyl carbonates, acetal, aldehyde, aldehyde hydrates, alkenyl, acrylate, methacrylate, acrylamide, active sulfone, amine, aminooxy, protected amine, hydrazide, protected hydrazide, protected thiol, carboxylic acid, protected carboxylic acid, isocyanate, isothiocyanate, maleimide, vinylsulfone, dithiopyridine, vinylpyridine, iodoacetamide, epoxide, glyoxals, diones, mesylates, tosylates, and tresylate, alkene, ketone, and acetylene. As would be understood, the selected X moiety should be compatible with the acetylene group so that reaction with the acetylene group does not occur. The acetylene-containing polymer derivatives may be homobifunctional, meaning that the second functional group (i.e., X) is also an acetylene moiety, or heterobifunctional, meaning that the second functional group is a different functional group.

In another embodiment of the present invention, the polymer derivatives comprise a polymer backbone having the structure: X—CH₂CH₂O—(CH₂CH₂O)_(n)CH₂CH₂—O—(CH₂—O—(CH₂)_(m)—C═CH wherein: X is a functional group as described above; n is about 20 to about 4000; and m is between 1 and 10. Specific examples of each of the heterobifunctional PEG polymers are shown below.

The acetylene-containing PEG derivatives of the invention can be prepared using methods known to those skilled in the art and/or disclosed herein. In one method, a water soluble polymer backbone having an average molecular weight from about 800 Da to about 100,000 Da, the polymer backbone having a first terminus bonded to a first functional group and a second terminus bonded to a suitable nucleophilic group, is reacted with a compound that bears both an acetylene functionality and a leaving group that is suitable for reaction with the nucleophilic group on the PEG. When the PEG polymer bearing the nucleophilic moiety and the molecule bearing the leaving group are combined, the leaving group undergoes a nucleophilic displacement and is replaced by the nucleophilic moiety, affording the desired acetylene-containing polymer: X-PEG-Nu+L-A-C→X-PEG-Nu-A-C═CR′.

As shown, a preferred polymer backbone for use in the reaction has the formula X-PEG-Nu, wherein PEG is poly(ethylene glycol), Nu is a nucleophilic moiety and X is a functional group that does not react with Nu, L or the acetylene functionality.

Examples of Nu include, but are not limited to, amine, alkoxy, aryloxy, sulfhydryl, imino, carboxylate, hydrazide, aminoxy groups that would react primarily via a SN2-type mechanism. Additional examples of Nu groups include those functional groups that would react primarily via a nucleophilic addition reaction. Examples of L groups include chloride, bromide, iodide, mesylate, tresylate, and tosylate and other groups expected to undergo nucleophilic displacement as well as ketones, aldehydes, thioesters, olefins, alpha-beta unsaturated carbonyl groups, carbonates and other electrophilic groups expected to undergo addition by nucleophiles.

In another embodiment of the present invention, A is an aliphatic linker of between 1-10 carbon atoms or a substituted aryl ring of between 6-14 carbon atoms. X is a functional group which does not react with azide groups and L is a suitable leaving group.

In another method for preparation of the acetylene-containing polymer derivatives of the invention, a PEG polymer having an average molecular weight from about 800 Da to about 100,000 Da, bearing either a protected functional group or a capping agent at one terminus and a suitable leaving group at the other terminus is contacted by an acetylene anion.

Water soluble polymers can be linked to the Fc proteins of the invention. The water soluble polymers may be linked via a non-naturally encoded amino acid incorporated in the Fc proteins or any functional group or substituent of a non-naturally encoded or naturally encoded amino acid, or any functional group or substituent added to a non-naturally encoded or naturally encoded amino acid. Alternatively, the water soluble polymers are linked to an antigen-binding polypeptide incorporating a non-naturally encoded amino acid via a naturally-occurring amino acid (including but not limited to, cysteine, lysine or the amine group of the N-terminal residue). In some cases, the Fc proteins of the invention comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 non-natural amino acids, wherein one or more non-naturally-encoded amino acid(s) are linked to water soluble polymer(s) (including but not limited to, PEG and/or oligosaccharides). In some cases, the Fc proteins of the invention further comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more naturally-encoded amino acid(s) linked to water soluble polymers. In some cases, the Fc protein of the invention comprise one or more non-naturally encoded amino acid(s) linked to water soluble polymers and one or more naturally-occurring amino acids linked to water soluble polymers. In some embodiments, the water soluble polymers used in the present invention enhance the serum half-life of the Fc proteins relative to the unconjugated form.

The number of water soluble polymers linked to an antigen-binding polypeptide (i.e., the extent of PEGylation or glycosylation) of the present invention can be adjusted to provide an altered (including but not limited to, increased or decreased) pharmacologic, pharmacokinetic or pharmacodynamic characteristic such as in vivo half-life. In some embodiments, the half-life of Fc protein is increased at least about 10, 20, 30, 40, 50, 60, 70, 80, 90 percent, 2-fold, 5-fold, 10-fold, 50-fold, or at least about 100-fold over an unmodified polypeptide.

In one embodiment of the present invention, an antigen-binding polypeptide comprising a carbonyl-containing non-naturally encoded amino acid is modified with a PEG derivative that contains a terminal hydrazine, hydroxylamine, hydrazide or semicarbazide moiety that is linked directly to the PEG backbone.

In some embodiments, the hydroxylamine-terminal PEG derivative will have the structure: RO—(CH₂CH₂O)_(n)—O—(CH₂)_(m)—O—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10 and n is 100-1,000 (i.e., average molecular weight is between 5-40 kDa).

In some embodiments, the hydrazine- or hydrazide-containing PEG derivative will have the structure: RO—(CH₂CH₂O)_(n)—O—(CH₂)_(m)—X—NH—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10 and n is 100-1,000 and X is optionally a carbonyl group (C═O) that can be present or absent.

In some embodiments, the semicarbazide-containing PEG derivative will have the structure: RO—(CH₂CH₂O)_(n)—O—(CH₂)_(m)—NH—C(O)—NH—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10 and n is 100-1,000.

In another embodiment of the invention, an antigen-binding polypeptide comprising a carbonyl-containing amino acid is modified with a PEG derivative that contains a terminal hydroxylamine, hydrazide, hydrazine, or semicarbazide moiety that is linked to the PEG backbone by means of an amide linkage.

In some embodiments, the hydroxylamine-terminal PEG derivatives have the structure: RO—(CH₂CH₂O)_(n)—O—(CH₂)₂—NH—C(O)(CH₂)_(m)—O—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10 and n is 100-1,000 (i.e., average molecular weight is between 5-40 kDa).

In some embodiments, the hydrazine- or hydrazide-containing PEG derivatives have the structure: RO—(CH₂CH₂O)_(n)—O—(CH₂)₂—NH—C(O)(CH₂)_(m)—X—NH—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10, n is 100-1,000 and X is optionally a carbonyl group (C═O) that can be present or absent.

In some embodiments, the semicarbazide-containing PEG derivatives have the structure: RO—(CH₂CH₂O)_(n)—O—(CH₂)₂—NH—C(O)(CH₂)_(m)—NH—C(O)—NH—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10 and n is 100-1,000.

In another embodiment of the invention, an Fc protein comprising a carbonyl-containing amino acid is modified with a branched PEG derivative that contains a terminal hydrazine, hydroxylamine, hydrazide or semicarbazide moiety, with each chain of the branched PEG having a MW ranging from 10-40 kDa and, more preferably, from 5-20 kDa.

In another embodiment of the invention, an Fc protein comprising a non-naturally encoded amino acid is modified with a PEG derivative having a branched structure. For instance, in some embodiments, the hydrazine- or hydrazide-terminal PEG derivative will have the following structure: [RO—(CH₂CH₂O)_(n)—O—(CH₂)₂—NH—C(O)]₂CH(CH₂)_(m)—X—NH—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10 and n is 100-1,000, and X is optionally a carbonyl group (C═O) that can be present or absent.

In some embodiments, the PEG derivatives containing a semicarbazide group will have the structure: [RO—(CH₂CH₂O)_(n)—O—(CH₂)₂—C(O)—NH—CH₂—CH₂]2CH—X—(CH₂)_(m)NH—C(O)—NH—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), X is optionally NH, O, S, C(O) or not present, m is 2-10 and n is 100-1,000.

In some embodiments, the PEG derivatives containing a hydroxylamine group will have the structure: [RO—(CH₂CH₂O)_(n)—O—(CH₂)₂—C(O)—NH—CH₂—CH₂]2CH—X—(CH₂)_(m)—O—NH₂ where R is a simple alkyl (methyl, ethyl, propyl, etc.), X is optionally NH, O, S, C(O) or not present, m is 2-10 and n is 100-1,000.

The degree and sites at which the water soluble polymer(s) are linked to the Fc proteins can modulate the binding of the Fc proteins to an antigen or receptor.

Methods and chemistry for activation of polymers as well as for conjugation of peptides are described in the literature and are known in the art. Commonly used methods for activation of polymers include, but are not limited to, activation of functional groups with cyanogen bromide, periodate, glutaraldehyde, biepoxides, epichlorohydrin, divinylsulfone, carbodiimide, sulfonyl halides, trichlorotriazine, etc. (see, R. F. Taylor, (1991), PROTEIN IMMOBILISATION. FUNDAMENTAL AND APPLICATIONS, Marcel Dekker, N.Y.; S. S. Wong, (1992), CHEMISTRY OF PROTEIN CONJUGATION AND CROSSLINKING, CRC Press, Boca Raton; G. T. Hermanson et al., (1993), IMMOBILIZED AFFINITY LIGAND TECHNIQUES, Academic Press, N.Y.; Dunn, R. L., et al., Eds. POLYMERIC DRUGS AND DRUG DELIVERY SYSTEMS, ACS Symposium Series Vol. 469, American Chemical Society, Washington, D.C. 1991).

Several reviews and monographs on the functionalization and conjugation of PEG are available. See, for example, Harris, Macronol. Chem. Phys. C25: 325-373 (1985); Scouten, Methods in Enzymology 135: 30-65 (1987); Wong et al., Enzyme Microb. Technol. 14: 866-874 (1992); Delgado et al., Critical Reviews in Therapeutic Drug Carrier Systems 9: 249-304 (1992); Zalipsky, Bioconjugate Chem. 6: 150-165 (1995).

Methods for activation of polymers can also be found in WO 94/17039, U.S. Pat. No. 5,324,844, WO 94/18247, WO 94/04193, U.S. Pat. No. 5,219,564, U.S. Pat. No. 5,122,614, WO 90/13540, U.S. Pat. No. 5,281,698, and WO 93/15189, and for conjugation between activated polymers and enzymes including but not limited to Coagulation Factor VIII (WO 94/15625), hemoglobin (WO 94/09027), oxygen carrying molecule (U.S. Pat. No. 4,412,989), ribonuclease and superoxide dismutase (Veronese at al., App. Biochem. Biotech. 11: 141-45 (1985)). All references and patents cited are incorporated by reference herein.

PEGylation (i.e., addition of any water soluble polymer) of antigen-binding polypeptides containing a non-naturally encoded amino acid, such as p-azido-L-phenylalanine, is carried out by any convenient method. For example, Fc protein is PEGylated with an alkyne-terminated mPEG derivative. Briefly, an excess of solid mPEG(5000)-O—CH₂—C═CH is added, with stirring, to an aqueous solution of p-azido-L-Phe-containing Fc protein at room temperature. Typically, the aqueous solution is buffered with a buffer having a pK_(a) near the pH at which the reaction is to be carried out (generally about pH 4-10). Examples of suitable buffers for PEGylation at pH 7.5, for instance, include, but are not limited to, HEPES, phosphate, borate, TRIS-HCl, EPPS, and TES. The pH is continuously monitored and adjusted if necessary. The reaction is typically allowed to continue for between about 1-48 hours.

The reaction products are subsequently subjected to hydrophobic interaction chromatography to separate the PEGylated Fc protein variants from free mPEG(5000)-O—CH₂—C═CH and any high-molecular weight complexes of the pegylated Fc protein which may form when unblocked PEG is activated at both ends of the molecule, thereby crosslinking Fc protein variant molecules. The conditions during hydrophobic interaction chromatography are such that free mPEG(5000)-O—CH₂—C═CH flows through the column, while any crosslinked PEGylated Fc protein variant complexes elute after the desired forms, which contain one Fc protein variant molecule conjugated to one or more PEG groups. Suitable conditions vary depending on the relative sizes of the cross-linked complexes versus the desired conjugates and are readily determined by those skilled in the art. The eluent containing the desired conjugates is concentrated by ultrafiltration and desalted by diafiltration.

If necessary, the PEGylated Fc proteins obtained from the hydrophobic chromatography can be purified further by one or more procedures known to those skilled in the art including, but are not limited to, affinity chromatography; anion- or cation-exchange chromatography (using, including but not limited to, DEAE SEPHAROSE); chromatography on silica; reverse phase HPLC; gel filtration (using, including but not limited to, SEPHADEX G-75); hydrophobic interaction chromatography; size-exclusion chromatography, metal-chelate chromatography; ultrafiltration/diafiltration; ethanol precipitation; ammonium sulfate precipitation; chromatofocusing; displacement chromatography; electrophoretic procedures (including but not limited to preparative isoelectric focusing), differential solubility (including but not limited to ammonium sulfate precipitation), or extraction. Apparent molecular weight may be estimated by GPC by comparison to globular protein standards (PROTEIN PURIFICATION METHODS, A PRACTICAL APPROACH (Harris & Angal, Eds.) IRL Press 1989, 293-306). The purity of the Fc protein-PEG conjugate can be assessed by proteolytic degradation (including but not limited to, trypsin cleavage) followed by mass spectrometry analysis. Pepinsky B., et al., J. Pharmcol. & Exp. Ther. 297(3):1059-66 (2001).

A water soluble polymer linked to an amino acid of an Fc protein of the invention can be further derivatized or substituted without limitation.

In another embodiment of the invention, an antigen-binding polypeptide is modified with a PEG derivative that contains an azide moiety that will react with an alkyne moiety present on the side chain of the non-naturally encoded amino acid. In general, the PEG derivatives will have an average molecular weight ranging from 1-100 kDa and, in some embodiments, from 10-40 kDa.

In some embodiments, the azide-terminal PEG derivative will have the structure: RO—(CH₂CH₂O)_(n)O—(CH₂)_(m)—N₃ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10 and n is 100-1,000 (i.e., average molecular weight is between 5-40 kDa).

In another embodiment, the azide-terminal PEG derivative will have the structure: RO—(CH₂CH₂O)_(n)—O—(CH₂)_(m)—NH—C(O)—(CH₂)_(p)—N₃, where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10, p is 2-10 and n is 100-1,000 (i.e., average molecular weight is between 5-40 kDa).

In another embodiment of the invention, an Fc protein comprising a alkyne-containing amino acid is modified with a branched PEG derivative that contains a terminal azide moiety, with each chain of the branched PEG having a MW ranging from 10-40 kDa and, more preferably, from 5-20 kDa. For instance, in some embodiments, the azide-terminal PEG derivative will have the following structure: [RO—(CH₂CH₂O)_(n)—O—(CH₂)₂—NH—C(O)]₂CH(CH-2)_(m)—X—(CH₂)_(p)—N₃ where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10, p is 2-10, and n is 100-1,000, and X is optionally an O, N, S or carbonyl group (C═O), in each case that can be present or absent.

In another embodiment of the invention, an antigen-binding polypeptide is modified with a PEG derivative that contains an alkyne moiety that will react with an azide moiety present on the side chain of the non-naturally encoded amino acid.

In some embodiments, the alkyne-terminal PEG derivative will have the following structure: RO—(CH₂CH₂O)_(n)—O—(CH₂)_(m)—C═CH where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10 and n is 100-1,000 (i.e., average molecular weight is between 5-40 kDa).

In another embodiment of the invention, an Fc protein comprising an alkyne-containing non-naturally encoded amino acid is modified with a PEG derivative that contains a terminal azide or terminal alkyne moiety that is linked to the PEG backbone by means of an amide linkage.

In some embodiments, the alkyne-terminal PEG derivative will have the following structure: RO—(CH₂CH₂)_(n)—O—(CH₂)_(m)—NH—C(O)—(CH₂)_(p)—C═CH where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10, p is 2-10 and n is 100-1,000.

In another embodiment of the invention, an antigen-binding polypeptide comprising an azide-containing amino acid is modified with a branched PEG derivative that contains a terminal alkyne moiety, with each chain of the branched PEG having a MW ranging from 10-40 kDa and, more preferably, from 5-20 kDa. For instance, in some embodiments, the alkyne-terminal PEG derivative will have the following structure: [RO—(CH₂CH₂O)_(n)—O—(CH₂)₂—NH—C(O)]₂CH(CH₂)_(m)—X—(CH₂)_(p)C═CH where R is a simple alkyl (methyl, ethyl, propyl, etc.), m is 2-10, p is 2-10, and n is 100-1,000, and X is optionally an O, N, S or carbonyl group (C═O), or not present.

In another embodiment of the invention, an Fc protein is modified with a PEG derivative that contains an activated functional group (including but not limited to, ester, carbonate) further comprising an aryl phosphine group that will react with an azide moiety present on the side chain of the non-naturally encoded amino acid. In general, the PEG derivatives will have an average molecular weight ranging from 1-100 kDa and, in some embodiments, from 10-40 kDa.

Other exemplary PEG molecules that may be linked to Fc proteins, as well as PEGylation methods include those described in, e.g., U.S. Patent Publication Nos. 2004/0001838; 2002/0052009; 2003/0162949; 2004/0013637; 2003/0228274; 2003/0220447; 2003/0158333; 2003/0143596; 2003/0114647; 2003/0105275; 2003/0105224; 2003/0023023; 2002/0156047; 2002/0099133; 2002/0086939; 2002/0082345; 2002/0072573; 2002/0052430; 2002/0040076; 2002/0037949; 2002/0002250; 2001/0056171; 2001/0044526; 2001/0027217; 2001/0021763; U.S. Pat. Nos. 6,646,110; 5,824,778; 5,476,653; 5,219,564; 5,629,384; 5,736,625; 4,902,502; 5,281,698; 5,122,614; 5,473,034; 5,516,673; 5,382,657; 6,552,167; 6,610,281; 6,515,100; 6,461,603; 6,436,386; 6,214,966; 5,990,237; 5,900,461; 5,739,208; 5,672,662; 5,446,090; 5,808,096; 5,612,460; 5,324,844; 5,252,714; 6,420,339; 6,201,072; 6,451,346; 6,306,821; 5,559,213; 5,612,460; 5,747,646; 5,834,594; 5,849,860; 5,980,948; 6,004,573; 6,129,912; WO 97/32607, EP 229,108, EP 402,378, WO 92/16555, WO 94/04193, WO 94/14758, WO 94/17039, WO 94/18247, WO 94/28024, WO 95/00162, WO 95/11924, WO95/13090, WO 95/33490, WO 96/00080, WO 97/18832, WO 98/41562, WO 98/48837, WO 99/32134, WO 99/32139, WO 99/32140, WO 96/40791, WO 98/32466, WO 95/06058, EP 439 508, WO 97/03106, WO 96/21469, WO 95/13312, EP 921 131, WO 98/05363, EP 809 996, WO 96/41813, WO 96/07670, EP 605 963, EP 510 356, EP 400 472, EP 183 503 and EP 154 316, which are incorporated by reference herein. Any of the PEG molecules described herein may be used in any form, including but not limited to, single chain, branched chain, multiarm chain, single functional, bi-functional, multi-functional, or any combination thereof.

In certain embodiments, the Fc proteins can be linked to the payloads with one or more linkers capable of reacting with the non-natural amino acid. The one or more linkers can be any linkers apparent to those of skill in the art. The term “linker” is used herein to refer to groups or bonds that normally are formed as the result of a chemical reaction and typically are covalent linkages. Hydrolytically stable linkages means that the linkages are substantially stable in water and do not react with water at useful pH values, including but not limited to, under physiological conditions for an extended period of time, perhaps even indefinitely. Hydrolytically unstable or degradable linkages mean that the linkages are degradable in water or in aqueous solutions, including for example, blood. Enzymatically unstable or degradable linkages mean that the linkage can be degraded by one or more enzymes. As understood in the art, PEG and related polymers may include degradable linkages in the polymer backbone or in the linker group between the polymer backbone and one or more of the terminal functional groups of the polymer molecule. For example, ester linkages formed by the reaction of PEG carboxylic acids or activated PEG carboxylic acids with alcohol groups on a biologically active agent generally hydrolyze under physiological conditions to release the agent. Other hydrolytically degradable linkages include, but are not limited to, carbonate linkages; imine linkages resulted from reaction of an amine and an aldehyde; phosphate ester linkages formed by reacting an alcohol with a phosphate group; hydrazone linkages which are reaction product of a hydrazide and an aldehyde; acetal linkages that are the reaction product of an aldehyde and an alcohol; orthoester linkages that are the reaction product of a formate and an alcohol; peptide linkages formed by an amine group, including but not limited to, at an end of a polymer such as PEG, and a carboxyl group of a peptide; and oligonucleotide linkages formed by a phosphoramidite group, including but not limited to, at the end of a polymer, and a 5′ hydroxyl group of an oligonucleotide. Branched linkers may be used in Fc proteins of the invention. A number of different cleavable linkers are known to those of skill in the art. See U.S. Pat. Nos. 4,618,492; 4,542,225, and 4,625,014. The mechanisms for release of an agent from these linker groups include, for example, irradiation of a photolabile bond and acid-catalyzed hydrolysis. U.S. Pat. No. 4,671,958, for example, includes a description of immunoconjugates comprising linkers which are cleaved at the target site in vivo by the proteolytic enzymes of the patient's complement system. The length of the linker may be predetermined or selected depending upon a desired spatial relationship between the Fc protein and the molecule linked to it. In view of the large number of methods that have been reported for attaching a variety of radiodiagnostic compounds, radiotherapeutic compounds, drugs, toxins, and other agents to Fc proteins one skilled in the art will be able to determine a suitable method for attaching a given agent to an Fc protein.

Any hetero- or homo-bifunctional linker can be used to link the conjugates. The linker may have a wide range of molecular weight or molecular length. Larger or smaller molecular weight linkers may be used to provide a desired spatial relationship or conformation between the Fc protein and the linked entity. Linkers having longer or shorter molecular length may also be used to provide a desired space or flexibility between the Fc protein and the linked entity. Similarly, a linker having a particular shape or conformation may be utilized to impart a particular shape or conformation to the Fc protein or the linked entity, either before or after the Fc protein reaches its target. The functional groups present on each end of the linker may be selected to modulate the release of an Fc protein or a payload under desired conditions. This optimization of the spatial relationship between the Fc protein and the linked entity may provide new, modulated, or desired properties to the molecule.

In some embodiments, the invention provides water-soluble bifunctional linkers that have a dumbbell structure that includes: a) an azide, an alkyne, a hydrazine, a hydrazide, a hydroxylamine, or a carbonyl-containing moiety on at least a first end of a polymer backbone; and b) at least a second functional group on a second end of the polymer backbone. The second functional group can be the same or different as the first functional group. The second functional group, in some embodiments, is not reactive with the first functional group. The invention provides, in some embodiments, water-soluble compounds that comprise at least one arm of a branched molecular structure. For example, the branched molecular structure can be dendritic.

Fc Protein Compositions

Fc proteins and conjugates described herein can be formulated into compositions using methods available in the art and those disclosed herein. Any of the compounds disclosed herein can be provided in the appropriate pharmaceutical composition and be administered by a suitable route of administration.

In certain embodiments, the Fc protein or Fc conjugate compositions provided herein further comprise a pharmaceutically acceptable carrier. The carrier can be a diluent, excipient, or vehicle with which the pharmaceutical composition is administered. Such pharmaceutical carriers can be sterile liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like. Saline solutions and aqueous dextrose and glycerol solutions can also be employed as liquid carriers, particularly for injectable solutions. Suitable pharmaceutical excipients include starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, glycerol, propylene, glycol, water, ethanol and the like. The composition, if desired, can also contain minor amounts of wetting or emulsifying agents, or pH buffering agents. These compositions can take the form of solutions, suspensions, emulsion, tablets, pills, capsules, powders, sustained-release formulations and the like. Oral formulation can include standard carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, etc. Examples of suitable pharmaceutical carriers are described in E.W. Martin, 1990, Remington's Pharmaceutical Sciences, Mack Publishing Co.

In some embodiments, the pharmaceutical composition is provided in a form suitable for administration to a human subject. In some embodiments, the pharmaceutical composition will contain a prophylactically or therapeutically effective amount of the Fc protein together with a suitable amount of carrier so as to provide the form for proper administration to the patient. The formulation should suit the mode of administration.

In some embodiments, the pharmaceutical composition is provided in a form suitable for intravenous administration. Typically, compositions suitable for intravenous administration are solutions in sterile isotonic aqueous buffer. Where necessary, the composition may also include a solubilizing agent and a local anesthetic such as lignocaine to ease pain at the site of the injection. Such compositions, however, may be administered by a route other than intravenous administration.

In particular embodiments, the pharmaceutical composition is suitable for subcutaneous administration. In particular embodiments, the pharmaceutical composition is suitable for intramuscular administration.

Components of the pharmaceutical composition can be supplied either separately or mixed together in unit dosage form, for example, as a dry lyophilized powder or water free concentrate. Where the composition is to be administered by infusion, it can be dispensed with an infusion bottle containing sterile pharmaceutical grade water or saline. Where the composition is administered by injection, an ample of sterile water for injection or saline can be provided so that the ingredients may be mixed prior to administration.

In some embodiments, the pharmaceutical composition is supplied as a dry sterilized lyophilized powder that is capable of being reconstituted to the appropriate concentration for administration to a subject. In some embodiments, Fc proteins are supplied as a water free concentrate. In some embodiments, the Fc protein is supplied as a dry sterile lyophilized powder at a unit dosage of at least 0.5 mg, at least 1 mg, at least 2 mg, at least 3 mg, at least 5 mg, at least 10 mg, at least 15 mg, at least 25 mg, at least 30 mg, at least 35 mg, at least 45 mg, at least 50 mg, at least 60 mg, or at least 75 mg.

In another embodiment, the pharmaceutical composition is supplied in liquid form. In some embodiments, the pharmaceutical composition is provided in liquid form and is substantially free of surfactants and/or inorganic salts. In some embodiments, the Fc protein is supplied as in liquid form at a unit dosage of at least 0.1 mg/ml, at least 0.5 mg/ml, at least 1 mg/ml, at least 2.5 mg/ml, at least 3 mg/ml, at least 5 mg/ml, at least 8 mg/ml, at least 10 mg/ml, at least 15 mg/ml, at least 25 mg/ml, at least 30 mg/ml, or at least 60 mg/ml.

In some embodiments, the pharmaceutical composition is formulated as a salt form. Pharmaceutically acceptable salts include those formed with anions such as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., and those formed with cations such as those derived from sodium, potassium, ammonium, calcium, ferric hydroxides, isopropylamine, triethylamine, 2-ethylamino ethanol, histidine, procaine, etc.

In therapeutic use, the practitioner will determine the posology most appropriate according to a preventive or curative treatment and according to the age, weight, stage of the infection and other factors specific to the subject to be treated. In certain embodiments, doses are from about 1 to about 1000 mg per day for an adult, or from about 5 to about 250 mg per day or from about 10 to 50 mg per day for an adult. In certain embodiments, doses are from about 5 to about 400 mg per day or 25 to 200 mg per day per adult. In certain embodiments, dose rates of from about 50 to about 500 mg per day are also contemplated.

Methods of Use for Therapy or Prophylaxis

Certain Fc proteins or Fc protein conjugates provided herein can be used for the treatment or prevention of any disease or condition deemed suitable to the practitioner of skill in the art. Generally, a method of treatment or prevention encompasses the administration of a therapeutically or prophylactically effective amount of the Fc protein or conjugate composition to a subject in need thereof to treat or prevent the disease or condition.

A therapeutically effective amount of the Fc protein or conjugate composition is an amount that is effective to reduce the severity, the duration and/or the symptoms of a particular disease or condition. The amount of the Fc protein or composition that will be therapeutically effective in the prevention, management, treatment and/or amelioration of a particular disease can be determined by standard clinical techniques. The precise amount of the Fc protein or composition to be administered with depend, in part, on the route of administration, the seriousness of the particular disease or condition, and should be decided according to the judgment of the practitioner and each subject's circumstances.

In some embodiments, the effective amount of the Fc protein or conjugate provided herein is between about 0.025 mg/kg and about 1000 mg/kg body weight of a human subject. In certain embodiments, the Fc protein is administered to a human subject at an amount of about 1000 mg/kg body weight or less, about 950 mg/kg body weight or less, about 900 mg/kg body weight or less, about 850 mg/kg body weight or less, about 800 mg/kg body weight or less, about 750 mg/kg body weight or less, about 700 mg/kg body weight or less, about 650 mg/kg body weight or less, about 600 mg/kg body weight or less, about 550 mg/kg body weight or less, about 500 mg/kg body weight or less, about 450 mg/kg body weight or less, about 400 mg/kg body weight or less, about 350 mg/kg body weight or less, about 300 mg/kg body weight or less, about 250 mg/kg body weight or less, about 200 mg/kg body weight or less, about 150 mg/kg body weight or less, about 100 mg/kg body weight or less, about 95 mg/kg body weight or less, about 90 mg/kg body weight or less, about 85 mg/kg body weight or less, about 80 mg/kg body weight or less, about 75 mg/kg body weight or less, about 70 mg/kg body weight or less, or about 65 mg/kg body weight or less.

In some embodiments, the effective amount of Fc protein or conjugate provided herein is between about 0.025 mg/kg and about 60 mg/kg body weight of a human subject. In some embodiments, the effective amount of an Fc protein of the pharmaceutical composition provided herein is about 0.025 mg/kg or less, about 0.05 mg/kg or less, about 0.10 mg/kg or less, about 0.20 mg/kg or less, about 0.40 mg/kg or less, about 0.80 mg/kg or less, about 1.0 mg/kg or less, about 1.5 mg/kg or less, about 3 mg/kg or less, about 5 mg/kg or less, about 10 mg/kg or less, about 15 mg/kg or less, about 20 mg/kg or less, about 25 mg/kg or less, about 30 mg/kg or less, about 35 mg/kg or less, about 40 mg/kg or less, about 45 mg/kg or less, about 50 mg/kg or about 60 mg/kg or less.

The pharmaceutical composition of the method can be administered using any method known to those skilled in the art. For example, the pharmaceutical composition can be administered intramuscularly, intradermally, intraperitoneally, intravenously, subcutaneously administration, or any combination thereof. In some embodiments, the pharmaceutical composition is administered subcutaneously. In some embodiments, the composition is administered intravenously. In some embodiments, the composition is administered intramuscularly.

Methods of Use for Detection or Diagnosis

The Fc proteins of Fc protein conjugates provided herein can be used for the detection of any target or for the diagnosis of any disease or condition deemed suitable to the practitioner of skill in the art. The methods encompass detecting the binding of an Fc protein to a target antigen in the appropriate location, e.g., the appropriate body, tissue, or cell. In the methods, the formation of a complex between the Fc protein and antigen can be detected by any method known to those of skill in the art. Examples include assays that use secondary reagents for detection, ELISA's and immunoprecipitation and agglutination assays. A detailed description of these assays is, for example, given in Harlow and Lane, Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory, New York 1988 555-612, WO96/13590 to Maertens and Stuyver, Zrein et al. (1998) and WO96/29605.

For in situ diagnosis, the Fc protein or conjugate may be administered to a subject by methods known in the art such as, for example, intravenous, intranasal, intraperitoneal, intracerebral, intraarterial injection such that a specific binding between an Fc protein according to the invention with an eptitopic region on the amyloid protein may occur. The Fc protein/antigen complex may conveniently be detected through a label attached to the Fc protein or any other art-known method of detection.

Further provided herein are kits for detection or diagnosis. Exemplary kits comprise one or more Fc proteins or conjugates provided herein along with one or more reagents useful for detecting a complex between the one or more Fc proteins and their target antigens.

Preparation of Fc Proteins

The Fc proteins described herein can be prepared by any technique apparent to those of skill in the art without limitation. Useful techniques for preparation include in vivo synthesis, for example with modified tRNA and tRNA synthetase, cell-free synthesis, for example with modified tRNA and tRNA synthetase, solid phase polypeptide synthesis and liquid phase polypeptide synthesis. Exemplary techniques are described in this section and in the examples below.

In certain methods, the Fc protein is translated and/or transcribed from one or more polynucleotides encoding the polypeptide chains of the Fc protein. Accordingly, provided herein are polynucleotides capable of encoding the Fc proteins having one or more non-natural amino acids at site-specific positions in one or more polypeptide chains. In certain embodiments, the polynucleotides comprise a codon not normally associated with an amino acid at the polynucleotide position corresponding to the site-specific polypeptide position for the non-natural amino acid. Examples of such codons include stop codons, 4 bp codons, 5 bp codons, and the like. The reaction mixture typically comprises a tRNA synthetase capable of making tRNAs that complement (suppress) corresponding codons. These suppressor tRNAs are linked to the non-natural amino acids to facilitate their incorporation into the polypeptide at the site of the suppressor codon.

The Fc proteins can be prepared by techniques known to those of skill in the art for expressing such polynucleotides to incorporate non-natural amino acids into site specific positions of a polypeptide chain. Such techniques are described, for example, in U.S. Pat. Nos. 7,045,337 and 7,083,970, in U.S. Published Patent Application Nos. US 2008/0317670, US 2009/0093405, US 2010/0093082, US 2010/0098630, US 2008/0085277 and in international patent publication nos. WO 2004/016778 A1 and WO 2008/066583 A2, the contents of which are hereby incorporated by reference in their entireties.

In certain embodiments, an Fc protein can be prepared in a cell-free reaction mixture comprising at least one orthogonal tRNA aminoacylated with an unnatural amino acid, where the orthogonal tRNA base pairs with a codon that is not normally associated with an amino acid, e.g. a stop codon; a 4 bp codon, etc. The reaction mixture also comprises a tRNA synthetase capable of aminoacylating the orthogonal tRNA with an unnatural amino acid. Usually the orthogonal tRNA synthetase, which is susceptible to degradation by proteases present in bacterial cell extracts, is exogenously synthesized and added to the reaction mix prior to initiation of polypeptide synthesis. The orthogonal tRNA may be synthesized in the bacterial cells from which the cell extract is obtained, may be synthesized de novo during the polypeptide synthesis reaction, or may be exogenously added to the reaction mix.

In certain embodiments, components that affect unnatural amino acid insertion and protein insertion or folding are optionally added to the reaction mixture. Such components include elevated concentrations of translation factors to minimize the effect of release factor 1 and 2 and to further optimize orthogonal component concentrations. Protein chaperones (Dsb System of oxidoreductases and isomerases, GroES, GroEL, DNAJ, DNAK, Skp, etc.) may be exogenously added to the reaction mixture or may be overexpressed in the source cells used to prepare the cell extract The reactions may utilize a large scale reactor, small scale, or may be multiplexed to perform a plurality of simultaneous syntheses. Continuous reactions will use a feed mechanism to introduce a flow of reagents, and may isolate the end-product as part of the process. Batch systems are also of interest, where additional reagents may be introduced to prolong the period of time for active synthesis. A reactor may be run in any mode such as batch, extended batch, semi-batch, semi-continuous, fed-batch and continuous, and which will be selected in accordance with the application purpose. The reactions may be of any volume, either in a small scale, usually at least about 1 μl and not more than about 15 μl, or in a scaled up reaction, where the reaction volume is at least about 15 μl, usually at least about 50 μl, more usually at least about 100 μl, and may be 500 μl, 1000 μl, or greater. In principle, reactions may be conducted at any scale as long as sufficient oxygen (or other electron acceptor) is supplied when needed.

Useful methods for synthesis where at least one unnatural amino acid is introduced into the polypeptide strand during elongation include but are not limited to: (I) addition of exogenous purified orthogonal synthetase, unnatural amino acid, and orthogonal tRNA to the cell-free reaction, (II) addition of exogenous purified orthogonal synthetase and unnatural amino acid to the reaction mixture, but with orthogonal tRNA transcribed during the cell-free reaction, (III) addition of exogenous purified orthogonal synthetase and unnatural amino acid to the reaction mixture, but with orthogonal tRNA synthesized by the cell extract source organism. In certain embodiments, the orthogonal components are driven by regulatable promoters, so that synthesis levels can be controlled although other measures may be used such as controlling the level of the relevant DNA templates by addition or specific digestion.

In some embodiments, a bacterial cell-free expression system is used to produce protein or peptide variants with non-native amino acids (nnAA). The use of bacterial cell-free extracts for in vitro protein synthesis offers several advantages over conventional in vivo protein expression methods. Cell-free systems can direct most, if not all, of the metabolic resources of the cell towards the exclusive production of one protein. Moreover, the lack of a cell wall and membrane components in vitro is advantageous since it allows for control of the synthesis environment. However, the efficiency of cell-free extracts can be decreased by bacterial proteins that inhibit protein synthesis, either directly or indirectly. Thus, inactivation of undesirable proteins that decrease the efficiency of protein synthesis should increase the yield of desirable proteins in cell-free extracts. For example, the inactivation of proteins that decrease the efficiency of protein synthesis should increase the yield of polypeptides having non-native amino acids incorporated at a defined amino acid residue. The introduction of nnAA into polypeptides is useful for increasing the biological diversity and function of proteins. One approach for producing polypeptides having a nnAA incoroporated at a defined amino acid residue is to use an nnAA, aminoacylated orthogonal CUA containing tRNA for introduction of the nnAA into the nascent polypeptide at an amber (stop) codon during protein translation. However, the incorporation of nnAA at an amber codon can be inhibited by the native bacterial termination complex, which normally recognizes the stop codon and terminates translation. Release Factor 1 (RF1) is a termination complex protein that facilitates the termination of translation by recognizing the amber codon in an mRNA sequence. RF1 recognition of the amber stop codon can promote pre-mature truncation products at the site of non-native amino acid incorporation, and thus decreased protein yield. Therefore, attenuating the activity of RF1 may increase nnAA incorporation into recombinant proteins.

It has previously been shown that nnAA incorporation can be increased by attenuating RF1 activity in 3 ways: 1) neutralizing antibody inactivation of RF1, 2) genomic knockout of RF1 (in an RF2 bolstered strain), and 3) site specific removal of RF1 using a strain engineered to express RF1 containing a protein tag for removal by affinity chromatography (Chitin Binding Domain and His Tag). Another method for inactivating RF1 comprises introducing proteolytic cleavage sites into the RF1 amino acid sequence. The cleavage sites are not accessible to the protease during bacterial cell growth, but are cleaved by the protease when the bacterial cells are lysed to produce cell-free extract. Thus, the yield of full length polypeptides having a nnAA incorporated at an amber codon is increased in bacterial cell extracts expressing such modified RF1 variants.

In some embodiments, in order to produce Fc proteins comprising a non-natural amino acid, one needs a nucleic acid template. The templates for cell-free protein synthesis can be either mRNA or DNA. The template can comprise sequences for any particular antibody of interest, and may encode a full-length antibody or a fragment of any length thereof. Nucleic acids that serve as protein synthesis templates are optionally derived from a natural source or they can be synthetic or recombinant. For example, DNAs can be recombinant DNAs, e.g., plasmids, viruses or the like.

In some embodiments, once a nucleic acid template of an Fc protein is produced, the template is used to synthesize the antibody in a cell-free translation system. For example, the template can be added to a cell lysate under conditions sufficient to translate the template into protein. The cell lysate can be from bacterial cells or eukaryotic cells. The expressed Fc protein can then be purified using methods known in the art, as described below.

In some embodiments, a translation system (e.g., an in vitro protein synthesis system) is used to produce the Fc protein with one or more nnAAs incorporated therein. An exemplary translation system comprises a cell free extract, cell lysate, or reconstituted translation system, along with the nucleic acid template for synthesis of the desired polypeptide or protein having non-native amino acids at preselected (defined) positions. The reaction mixture will further comprise monomers for the macromolecule to be synthesized, e.g., amino acids, nucleotides, etc., and such co-factors, enzymes and other reagents that are necessary for the synthesis, e.g., ribosomes, tRNA, polymerases, transcriptional factors, etc. In addition to the above components such as a cell-free extract, nucleic acid template, and amino acids, materials specifically required for protein synthesis may be added to the reaction. The materials include salts, folinic acid, cyclic AMP, inhibitors for protein or nucleic acid degrading enzymes, inhibitors or regulators of protein synthesis, adjusters of oxidation/reduction potentials, non-denaturing surfactants, buffer components, spermine, spermidine, putrescine, etc. Various cell-free synthesis reaction systems are well known in the art. See, e.g., Kim, D. M. and Swartz, J. R. Biotechnol. Bioeng. 66:180-8 (1999); Kim, D. M. and Swartz, J. R. Biotechnol. Prog. 16:385-90 (2000); Kim, D. M. and Swartz, J. R. Biotechnol. Bioeng. 74:309-16 (2001); Swartz et al, Methods Mol. Biol. 267:169-82 (2004); Kim, D. M. and Swartz, J. R. Biotechnol. Bioeng. 85:122-29 (2004); Jewett, M. C. and Swartz, J. R., Biotechnol. Bioeng. 86:19-26 (2004); Yin, G. and Swartz, J. R., Biotechnol. Bioeng. 86:188-95 (2004); Jewett, M. C. and Swartz, J. R., Biotechnol. Bioeng. 87:465-72 (2004); Voloshin, A. M. and Swartz, J. R., Biotechnol. Bioeng. 91:516-21 (2005). Additional conditions for the cell-free synthesis of desired polypeptides are described in WO2010/081110, the contents of which are incorporated by reference herein in its entirety.

In some embodiments, a DNA template is used to drive in vitro protein synthesis, and RNA polymerase is added to the reaction mixture to provide enhanced transcription of the DNA template. RNA polymerases suitable for use herein include any RNA polymerase that functions in the bacteria from which the bacterial extract is derived. In other embodiments, an RNA template is used to drive in vitro protein synthesis, and the components of the reaction mixture can be admixed together in any convenient order, but are preferably admixed in an order wherein the RNA template is added last, thereby minimizing potential degradation of the RNA template by nucleases.

In some embodiments, a cell-free translation system is used to produce the antibody with one or more nnAAs incorporated therein. Cell-free protein synthesis exploits the catalytic power of the cellular machinery. Obtaining maximum protein yields in vitro requires adequate substrate supply, e.g., nucleoside triphosphates and amino acids, a homeostatic environment, catalyst stability, and the removal or avoidance of inhibitory byproducts. The optimization of in vitro synthetic reactions benefits from recreating the in vivo state of a rapidly growing organism. In some embodiments of the invention, cell-free synthesis is therefore performed in a reaction where oxidative phosphorylation is activated. Additional details are described in U.S. Pat. No. 7,338,789, the contents of which are incorporated by reference herein in its entirety.

In vitro, or cell-free, protein synthesis offers several advantages over conventional in vivo protein expression methods. Cell-free systems can direct most, if not all, of the metabolic resources of the cell towards the exclusive production of one protein. Moreover, the lack of a cell wall and membrane components in vitro is advantageous since it allows for control of the synthesis environment. For example, tRNA levels can be changed to reflect the codon usage of genes being expressed. The redox potential, pH, or ionic strength can also be altered with greater flexibility than with in vivo protein synthesis because concerns of cell growth or viability do not exist. Furthermore, direct recovery of purified, properly folded protein products can be easily achieved. In some embodiments, the productivity of cell-free systems has improved over 2-orders of magnitude in recent years, from about 5 μg/ml-hr to about 500 μg/ml-hr.

In certain embodiments, tRNA synthetase is exogenously synthesized and added to the cell-free reaction mix. In certain embodiments, the reaction mix is prepared from bacterial cells in which ompT has been inactivated or is naturally inactive. OmpT is believed to degrade components of the reaction mixture including tRNA synthetase.

In addition to the above components such as cell-free extract, genetic template, and amino acids, materials specifically required for protein synthesis may be added to the reaction. These materials include salts, folinic acid, cyclic AMP, inhibitors for protein or nucleic acid degrading enzymes, inhibitors or regulators of protein synthesis, adjusters of oxidation/reduction potential(s), non-denaturing surfactants, buffer components, spermine, spermidine, putrescine, etc.

The salts preferably include potassium, magnesium, and ammonium salts (e.g. of acetic acid or glutamic acid). One or more of such salts may have an alternative amino acid as a counter anion. There is an interdependence among ionic species for optimal concentration. These ionic species are typically optimized with regard to protein production. When changing the concentration of a particular component of the reaction medium, that of another component may be changed accordingly. For example, the concentrations of several components such as nucleotides and energy source compounds may be simultaneously adjusted in accordance with the change in those of other components. Also, the concentration levels of components in the reactor may be varied over time. The adjuster of oxidation/reduction potential may be dithiothreitol, ascorbic acid, glutathione and/or their oxidized forms.

In certain embodiments, the reaction can proceed in a dialysis mode, in a diafiltration batch mode, in a fed-batch mode of in a semi-continuous operation mode. In certain embodiments, a feed solution can be supplied to the reactor through a membrane or through an injection unit. Synthesized Fc protein can accumulate in the reactor followed by isolation or purification after completion of the system operation. Vesicles containing the Fc protein may also be continuously isolated, for example by affinity adsorption from the reaction mixture either in situ or in a circulation loop as the reaction fluid is pumped past the adsorption matrix.

During protein synthesis in the reactor, the protein isolating means for selectively isolating the desired protein may include a unit packed with particles coated with Fc protein molecules or other molecules for adsorbing the synthesized, desired protein. Preferably, the protein isolating means comprises two columns for alternating use.

The resulting Fc protein can be purified or isolated by standard techniques. Exemplary techniques are provided in the examples below.

Assay Methods

Fc proteins can be assayed for their expected activity, or for a new activity, according to any assay apparent to those of skill in the art. The resulting Fc protein can be assayed activity in a functional assay, e.g. binding to Fc receptor, or by quantitating the amount of protein present in a non-functional assay, e.g. immunostaining, ELISA, quantitation on Coomasie or silver stained gel, etc., and determining the ratio of biologically active protein to total protein.

The amount of protein produced in a translation reaction can be measured in various fashions. One method relies on the availability of an assay which measures the activity of the particular protein being translated. An example of an assay for measuring protein activity is a luciferase assay system, or chloramphenical acetyl transferase assay system. These assays measure the amount of functionally active protein produced from the translation reaction. Activity assays will not measure full length protein that is inactive due to improper protein folding or lack of other post translational modifications necessary for protein activity.

Another method of measuring the amount of protein produced in coupled in vitro transcription and translation reactions is to perform the reactions using a known quantity of radiolabeled amino acid such as ³⁵S-methionine, ³H-leucine or ¹⁴C-leucine and subsequently measuring the amount of radiolabeled amino acid incorporated into the newly translated protein. Incorporation assays will measure the amount of radiolabeled amino acids in all proteins produced in an in vitro translation reaction including truncated protein products. The radiolabeled protein may be further separated on a protein gel, and by autoradiography confirmed that the product is the proper size and that secondary protein products have not been produced.

EXAMPLES

As used herein, the symbols and conventions used in these processes, schemes and examples, regardless of whether a particular abbreviation is specifically defined, are consistent with those used in the contemporary scientific literature, for example, the Journal of Biological Chemistry.

For all of the following examples, standard work-up and purification methods known to those skilled in the art can be utilized. Unless otherwise indicated, all temperatures are expressed in ° C. (degrees Centigrade). All methods are conducted at room temperature unless otherwise noted. The follow examples set exemplary Fc-containing molecules containing non-natural amino acids that are in the context of full-length antibody sequences. It is believed that the N-terminal heavy chain sequences and light chain sequences do not significantly affect the suitability of the particular sites for incorporation of a non-natural amino acid and/or conjugation to that non-natural amino acid. Thus, sites that have desirable properties in the context of a full-length antibody also have desirable properties in the context of an Fc protein.

Example 1 Synthesis of Exemplary Antibodies Containing Non-Natural Amino Acids and Conjugation to an Exemplary Cytotoxic Agent

The following example describes exemplary antibodies containing non-natural amino acids at defined positions along with methods of their construction and expression. The antibodies were assessed for expression, suppression efficiency, efficiency of conjugation with a cell-killing agent, cell binding, and cell killing, as set forth below. In this example, the exemplary antibodies are based on parent antibody trastuzumab (U.S. Pat. No. 6,165,464 and 2006/0018899 A1; Carter et al., 1992, Proc. Natl. Acad. Sci. USA 10:4285-4289).

First, constructs were made incorporating a TAG amber codon at a defined position in the heavy chain of trastuzumab. Site directed mutagenesis was performed using a pYD plasmid containing the coding region of trastuzumab with a C-terminal-(His)₆ tag (SD02005) as DNA template and synthetic oligonucleotides (Eurofins MWS Operon; Huntsville, Ala.) containing mutations of interest in both sense and antisense directions. Oligonucleotides of each mutation were added to the DNA template and PHUSION® polymerase (New England Biolabs; Ipswich, Mass.) to a final volume of 20 μL. The final concentration of each component was 0.16 μM of each oligonucleotide, 0.5 ng/μL template DNA, 0.02 U/μL PHUSION® polymerase in HF buffer containing 1.5 mM MgCl₂ and 200 μM dNTP. Mixture was incubated at 98° C. 5 m, 18 PCR cycles (98° C. 30 s, 55° C. 1 min, 72° C. 4 min), 10 min at 72° C. and stored at 4° C. DpnI (New England Biolabs; Ipswich, Mass.) was added to the mixture to final concentration of 0.6 U/μL and incubated for 37° C. 1 h to digest parent DNA.

5 μL of each mixture was then transformed into 50 μL of chemically competent E. coli cells with a MultiShot™ 96-Well Plate TOP10 according to the manufacturer's procedure (Invitrogen; Carlsbad, Calif.). Transformed cells were recovered in 200 μL SOC (Invitrogen; Carlsbad, Calif.) at 37° C. for 1 hr and plated onto Luria-Bertani (LB) agar supplemented with 50 μg/mL kanamycin (Teknova; Hollister, Calif.). After 24 hrs at 37° C., colonies were picked into 200 μL LB with 7.5% glycerol and 50 μg/mL kanamycin, and grown at 37° C. for 24 hrs. 20 μL of culture was used for rolling circle amplification (RCA) and sequenced by primer extension using T7 (SEQ ID NO: 2, 5′TAATACGACTCACTATAGG 3′) and T7 term (SEQ ID NO: 3, 5′GCTAGTTATTGCTCAGCG3′) primers. Sequence was analyzed using SEQUENCHER® software (Gene Codes Corp; Ann Arbor, Mich.), and clones containing mutations were picked and arrayed into 96 well plates. Overnight cultures of selected variants were grown and used for preparing plasmid DNA using the standard 96 well mini-prep protocol according to the manufacturer (Qiagen; Germantown, Md.). Concentration of mini-prepped DNA was measured using absorbance at 260 nm (and 280 nm).

The variants were then expressed in a cell-free protein synthesis reaction as follows as described in Zawada et al., 2011, Biotechnol. Bioeng. 108(7)1570-1578 with the modifications described below. Unsubstituted trastuzumab was also made as a control. Cell-free extracts were treated with 50 μM iodoacetamide for 30 min at RT (20° C.) and added to a premix containing all other components except for heavy chain DNA from variants of interest. Cell free reactions were initiated by addition of plasmid DNA of heavy chain DNA variants and incubated at 30° C. for 12 h on a shaker at 450 rpm in 96 deep well plates. The reaction was incubated further at 4° C. for 5 h. The final concentration in the protein synthesis reaction was 30% cell extract, 1 mM para-azido phenylalanine (pAzF) (RSP Amino Acids), 0.125 mg/mL M. jannaschii amber suppressor tRNA, 0.37 mg/mL M. jannaschii pAzF-specific amino-acyl tRNA synthetase (FRS), 2 mM GSSG, 0.29 mg/mL PDI (Mclab), 100 μg/mL E. coli DsbC, 8 mM magnesium glutamate, 10 mM ammonium glutamate, 130 mM potassium glutamate, 35 mM sodium pyruvate, 1.2 mM AMP, 0.86 mM each of GMP, UMP, and CMP, 2 mM amino acids (except 0.5 mM for Tyrosine and Phenylalanine), 4 mM sodium oxalate, 1 mM putrescine, 1.5 mM spermidine, 15 mM potassium phosphate, 100 nM T7 RNAP, 2 μg/mL trastuzumab light chain DNA, 8 μg/mL trastuzumab-(His)6 heavy chain DNA. Each trastuzamab variant was produced in 1 mL scale in 96 deep well plates in duplicates. A total of three plates were used to express the variants, each compared to expression of an unsubstituted trastuzumab to normalize for expression across the plates. It should be noted that all trastuzumab variants thus produced were aglycosylated.

To monitor protein synthesis, a portion of each cell-free protein synthesis reaction was removed and spiked with 3.33% (v/v) 1-[U-14C]-leucine (300 mCi/mmole; GE Life Sciences, Piscataway, N.J.). The suppression of amber codon at different sites of the heavy chain was determined by [¹⁴C]-autoradiograhy of reducing SDS-PAGE gels. Full length trastuzumab heavy chain and suppressed tastuzumab heavy chain variants run at 50 kD on SDS-PAGE. Non suppressed (truncated) trastuzumab variants run at a lower molecular weight. Amber suppression in the heavy chain is determined by:

${suppression} = \frac{{band}\mspace{14mu} {intensity}\mspace{14mu} {of}\mspace{14mu} {suppressed}\mspace{14mu} {heavy}\mspace{14mu} {chain}\mspace{14mu} {TAG}\mspace{14mu} {variant}}{{band}\mspace{14mu} {intensity}\mspace{14mu} {of}\mspace{14mu} {wild}\mspace{14mu} {type}\mspace{14mu} {heavy}\mspace{14mu} {chain}}$

Band intensity was determined by ImageQuant (Amersham Biosciences Corp.; Piscataway, N.J.). It should be noted that this 14C gel assay cannot be used to assess suppressions for TAG variants of heavy chain from the position of 425 (EU index numbering) to C terminus since the truncated variants cannot be distinguished from the full length product.

Following synthesis, each 1 mL reaction was diluted with 1 mL PBS at pH7.4. The mixture was centrifuged at 5000×g 4° C. for 15 min. Supernatant was captured with IMAC Phytip containing 40 μL resin (PhyNexus, Inc.; San Jose, Calif.) by pipetting up and down 4 times slowly at a flow rate of 4.2 l/min. Resin was washed with 925 μL IMAC binding buffer (50 mM Tris pH 8.0, 300 mM NaCl, 10 mM imidazole) by pipetting up and down twice at a flow rate of 8.3 μL/min. This process was repeated once with an additional 925 μL IMAC binding buffer. Bound protein was eluted with 250 μL IMAC elution buffer (50 mM Tris pH 8.0, 300 mM NaCl, 500 mM Imidazole) by pipetting up and down 4 times at a flow rate of 4.2 μL/min. As the (his)6 tag of the construct is C-terminal, this procedure allows isolation of full length protein. To quantitate the variants following purification, IMAC purified samples were mixed with 2× gel loading buffer (Bio-Rad#161-0737) resolved by of 4-15% Stain-Free™ gel (Bio-Rad Criterion™ TGX #567-8085). Samples were not heated before loading into wells of the gel. Protein bands were visualized and quantitated by Bio-Rad Gel Doc EZ System using Image Lab software (v3). Band intensities of samples were determined based on mass standards of HERCEPTIN® loaded on the same gel.

Next, the trastuzumab variants were conjugated to an exemplary cytotoxic agent, MMAF, using a constrained cyclooctyne reagent. In brief, DBCO-MMAF (structure shown as FIG. 1; ACME Bioscience; Palo Alto, Calif.) was dissolved in DMSO to a final concentration of 5 mM. The compound was diluted into with PBS to 1 mM and then added to trastuzumab-(His)₆ variants in IMAC elution buffer to final drug concentration of 100 μM. Mixture was incubated at RT (20° C.) for 17 hours. Reaction was stopped by adding Sodium Azide to final concentration of 100 mM and buffer exchanged using Zeba plates (Thermo Scientific; Waltham, Mass.) equilibrated in 1×PBS. Filtrate was then passed through a MUSTANG® Q plate (Pall Corp.; Port Washington, N.Y.) to remove endotoxin.

Example 2 Characterization of Exemplary Antibody-Drug Conjugates

Hydrophobic Interaction Chromatography (HIC) was performed to quantitate the samples and to determine the drug-antibody ratios of the trastuzumab variant drug conjugates as follows. Samples and standards were diluted 1:2 in 3M Ammonium Sulfate (EMD Chemical), 50 mM Sodium Phosphate pH 7.0 (Mallinckrodt) prepared in MilliQ water. An Agilent 1100 binary pump HPLC system was equipped with a Tosoh Bioscience LLC TSK-gel Butyl-NPR® (4.6 mm×3.5 cm) column with a column compartment temperature of 30° C. The mobile phase A was 1.5M Ammonium Sulfate, 50 mM Sodium Phosphate, pH 7.0. The mobile phase B was 50 mM Sodium Phosphate, pH 7.0 in 80:20 water:Isopropyl Alcohol (Honeywell). The mobile phase was delivered at a flow rate of 1.0 mL/minute. The separation was performed with a linear gradient of 15% mobile phase B to 100% mobile phase B in 10 minutes. The UV data was acquired at both 210 and 280 nm. The peak areas were quantitated using Chemstation software (Agilent) and the Drug Antibody Ratio (DAR) was calculated from the percent of the total peak area.

To assess binding of the conjugated trastuzumab variants, a binding assay to cells expressing HER2 was performed as follows. The binding of the purified conjugated variants to HER2 on SKBR3 cells, which overexpress the HER2/c-erb-2 gene product, with over 1.5 million receptor copies per cell (ATCC # HTB-30, Manassas, Va.) was compared to clinical grade Herceptin®, unglycosylated trastuzumab produced by cell-free protein synthesis, or human serum IgG1 as a negative control (Sigma-Aldrich; St. Louis, Mo.). SKBR3 cells were cultured in DMEM:Ham's F-12 (50:50), high glucose (Cellgro-Mediatech; Manassas, Va.) supplemented with 10% heat-inactivated fetal bovine serum (Hyclone; Thermo Scientific; Waltham, Mass.), 2 mM glutamax (Invitrogen; Carlsbad, Calif.) and 1× Pencillin/streptomycin (Cellgro-Mediatech; Manassas, Va.). Adherent cells were washed twice with calcium and magnesium-free Hanks Balanced Salt Solution (HBSS), harvested with HYQ® TASE™ (Hyclone; Thermo Scientific; Waltham, Mass.). A total of 200,000 cells per sample in total volume of 10 μL were incubated with serial dilutions of either conjugated trastuzumab variant, clinical grade HERCEPTIN®, or aglycosylated trastuzumab made in 10 μL FACS buffer (DPBS buffer supplemented with 1% bovine serum albumin). Cells plus antibody or ADC were incubated for 60 minutes on ice. Unstained cells, human IgG1 (Isotype control) and Secondary antibody (goat anti-human IgG) were used as controls. Cells were washed twice with ice-cold FACS buffer and incubated with either 5 μg/ml Alexa 647 labeled goat anti-human IgG secondary antibody (Invitrogen; Carlsbad, Calif.) on ice for 1 hour. All samples were washed using FACS buffer and analyzed using a BD FACS Calibur system (BD Biosciences; San Jose, Calif.).

Mean fluorescence intensities were fitted using non-linear regression analysis with one site specific binding equation using GraphPad Prism (GraphPad v 5.00, Software; San Diego, Calif.). Data was expressed as Relative MFI vs. concentration of antibody or antibody variant in g/ml.

Next, the effects of the conjugated trastuzumab on cell killing were measured with a cell proliferation assay as follows. SKBR3 and MDA-MB-468 were obtained from ATCC and maintained in DMEM:Ham's F-12 (50:50), high glucose (Cellgro-Mediatech; Manassas, Va.) supplemented with 10% heat-inactivated fetal bovine serum (Hyclone; Thermo Scientific; Waltham, Mass.), 2 mM glutamax (Invitrogen; Carlsbad, Calif.) and 1× Pencillin/streptomycin (Cellgro-Mediatech; Manassas, Va.). Adherent cells were washed twice with calcium and magnesium-free Hanks Balanced Salt Solution (HBSS), harvested with HYQ® TASE™ (Hyclone; Thermo Scientific; Waltham, Mass.). A total of 10³ cells were seeded in a volume of 40 μl in a 96-well half area flat bottom white Polystyrene plate. The cells were allowed to adhere overnight at 37° C. in a CO₂ incubator. ADC variants were formulated at 2× concentration in DMEM/F12 medium and filtered through MultiScreen_(HTS) 96-Well Filter Plates (Millipore; Billerica, Mass.). Filter sterilized conjugated trastuzumab variants, HERCEPTIN®, or aglycoslyated trastuzumab were added into treatment wells and plates were cultured at 37° C. in a CO2 incubator for 120 hrs. For cell viability measurement, 80 μl of Cell Titer-Glo® reagent (Promega Corp.; Madison, Wis.) was added into each well, and plates processed as per product instructions. Relative luminescence was measured on an ENVISION® plate reader (Perkin-Elmer; Waltham, Mass.). Relative luminescence readings were converted to % viability using untreated cells as controls. Data was fitted with non-linear regression analysis, using log(inhibitor) vs. response-Variable slope, 4-parameter fit equation using GraphPad Prism (GraphPad v 5.00, Software; San Diego, Calif.). Data was expressed as relative cell viability, ATP content % vs. dose of ADC in μg/ml.

Positive results of these characterization experiments are presented in Tables 1, 2, and 3 (plates 1, 2, and 3, respectively). In the Tables, the following properties of the variants are shown: the rank order and concentration of protein made, the efficiency of expression of full length product compared to wild-type (suppression efficiency), ability to bind HER2-expressing cells, drug-antibody ratio (expressed as the number of cytotoxic agents per antibody), a comment regarding the HIC profile, and the observed IC₅₀ from the cell-killing assay described above. In the comments regarding the HIC assay, Single Peak corresponds to a profile that resolved well as a single peak (an example of such a profile is shown as FIG. 2), NWR corresponds to a poorly resolved profile (an example of such a profile is shown as FIG. 3), and WR corresponds to a well-resolved HIC profile (an example of such a profile is shown as FIG. 4). Several of the variants that exhibited a single peak profile also exhibited potent killing, which indicates that the variant was indeed conjugated with drug, but failed to resolve on the HIC column.

In general, preferred variants exhibit an IC₅₀ of about 20 ng/ml or below, even more preferably below about 10 ng/ml, and even more preferably below about 5 ng/ml. Preferably, the variants are in the top 50% of expression, and even more preferably, are in the top 25% of expression. Variants that have a DAR of at least about 1.0 are also preferred. Preferred variants include those with a non-natural amino acid replacing positions P238, S239, F241, F243, K246, E356, M358, K360, V262, V264, D265, S267, H268, E269, D270, P271, E272, K274, F275, Y278, D280, G281, V282, E283, N286, T289, R292, E293, E294, Q295, Y296, N297, S298, T299, Y300, R301, V303, V305, K317, K320, S324, K326, A327, P329, A330, 1332, E333, K334, T335, S337, A339, Q342, R344, R355, T359, L398, S375, S383, N384, Q386, N389, K392, F404, G420, K340, Q438, N421, and Y₄₃₆, where the letter denotes a specific amino acid and the number denotes the position of the particular amino acid. Even more preferred were variants including those with a non-natural amino acid replacing positions S239, F241, K246, S267, H268, E269, D270, P271, E272, K274, F275, D280, G281, V282, E283, N286, T289, R292, E293, E294, Q295, Y296, N297, S298, T299, Y300, R301, V303, V305, K317, K320, S324, K326, A327, P329, A330, 1332, E333, K334, T335, S337, A339, Q342, R344, R355, T359, L398, S375, Q386, N389, K392, F404, G420, K340, Q438, and N421. Still more preferred were variants including those with a non-natural amino acid replacing positions S239, F241, S267, E269, D270, P271, E272, V282, N286, R292, E293, Y296, S298, P329, A330, K334, T335, K340, Q342, R355, T359, Q386, N389, F404, G420, N421, and Q438. Variants with a non-natural amino acid replacing positions S239, E293, K334, Q342, R355, T359, and N389 were identified as particularly preferred.

It was also noted that the expression, suppression efficiency, conjugation efficiency, cell-binding, and cell killing were not predictable based on the published crystal structure of Fc. For example, E293 and K334 would not be predicted to accept substitution with an aromatic phenylalanine derivative based on their positions within the Fc structure, yet both express well and are potent killers, with IC₅₀ values of 2.7 and 3.0 ng/ml, respectively. Conversely, S131 was predicted to be an excellent position for conjugation based on its availability on the surface of the heavy chain, yet derivatives with substitutions in this position were poor conjugators (DAR of 0.65) and poor killers (IC₅₀ of 27.1 ng/mL). Thus, it was concluded that experimentation was required to identify optimal sites for expression, suppression, conjugation, and cell killing.

It should also be noted that sites near the N-linked glycosylation site of trastuzumab, N297, were found to be well-suited for conjugation. In particular, sites from R292 through R301, V303 and V305 were found to exhibit desirable cell-killing and/or conjugation properties. It was possible to assess these sites because of the aglycosylated form of the trastuzumab variants produced in the cell-free protein synthesis reaction as described above.

Example 3 Fidelity of Incorporation of a Non-Natural Amino Acid in an Exemplary Antibody-Drug Conjugate

This Example describes an evaluation of the fidelity of incorporation of a non-natural amino acid in an exemplary antibody-drug conjugate. Trastuzumab substituted with p-azido-phenylalanine substituted at S136 conjugated with DBCO-MMAF was digested with trypsin and analyzed on an Agilent 6520 Accurate Mass Q-TOF LC-MS system equipped with a nano-electrospray ChipCube source. Peptides containing potential misincorporated amino acids and their wild-type counter parts were searched using extracted ion chromatograms from the LC-MS data within ±10 ppm of the theoretical m/z for the z=1-4 charge states of the peptide. If the m/z of the found peak were within ±10 ppm of the theoretical m/z and the correct charge state they were considered to be potential matches until MS/MS verification. In the absence of signal from misincorporation, the signal was assumed to be equal or less than the noise in the mass spectrum. The minimum fidelity rate was calculated as a noise to signal ratio. The signal was measured as the average of all monoisotopic ions across the peptide's elution profile for the most abundant charge state and the noise was estimated in the same spectrum in a region adjacent to the exhibiting a minimum amount of signal. Where an observed peptide contained a misincorporation event was found, its averaged signal was used in place of the noise measurement.

In the peptide map of the tested antibody-drug conjugate, peptide with nnAA+conjugated drug was detected at charge states 2 and 3 with an error of less than 4 ppm. At position 136, no tyrosine incorporation was present and the incorporation efficacy for pAzPhe was determined to be 98.3%. No tyrosine miss-incorporation was detected at the Phe positions across the detected IgG sequence. The overall, incorporation efficiency for Phe was measured to be at minimum 99.7%.

TABLE 1 IC₅₀ IMAC ¹⁴C CFPS SKBR3 gel expression ¹⁴C Suppression Cell HIC cell killing Variant μg/mL rank order μg/mL efficiency Binding DAR Profile (ng/ml) S136 151 77 35% + 1.54 WR 7.9 S239 161 2 126 57% + ND Single 5.2 Peak A118 156 7 105 48% + 1.51 WR 6.5 K246 162 18 75 34% + ND Single 6.6 Peak S119 191 6 109 50% + 1.41 7.4 S132 307 1 158 72% + 1.58 WR 8 A162 145 17 77 35% + 1.38 NWR 8 V005 186 3 124 56% + 0.99 NWR 11.5 S191 183 4 114 52% + 0.68 WR 13.9 S074 177 9 96 44% + 0.78 NWR 9.9 T135 151 8 96 44% + 1.03 NWR 10.1 A084 150 5 114 52% + 1.28 WR 9 G194 145 11 89 40% + 1.1 WR 10.8 T139 138 24 55 25% + 1 WR 9.6 A172 130 21 64 29% + 1.3 NWR 12.5 S134 124 16 80 36% + 1.37 WR 8.9 G137 112 15 83 38% + 1.23 WR 9.3 A023 110 19 70 32% + 1.33 NWR 9.8 S165 103 20 68 31% + 1.04 NWR 9.5 F241 99 13 87 39% + ND Single 8.4 Peak S160 96 14 86 39% + 1.34 WR 10 P238 93 22 60 27% + 0.47 NWR 12.9 T155 89 10 95 43% + 1.19 WR 10.5 V264 88 32 29 13% + ND Single 10.5 Peak S176 74 31 38 17% + 0.82 NWR 12.4 G138 72 29 43 20% + 1.16 WR 11.2 G065 68 30 40 18% + ND NWR 9.6 G042 68 26 49 22% + 1.12 NWR 13.8 F243 67 22 60 27% + 0.12 NWR 10.4 V262 47 36 18  8% shifted rt ND NWR 8.2 D265 56 38 13  6% + ND Single 9 Peak L174 52 28 43 20% + 1.07 NWR 12.8 S219 26 27 45 20% + 1.18 NWR 13.6 S131 76 25 49 22% 0.65 27.1 G161 43 33 26 12% 0.97 NWR 87.7 T164 17 35 22 10% 0.68 NWR 17.5 T195 26 39 14  6% 1.1 NWR S177 23 34 24 11% + 0.68 NWR 13.6

TABLE 2 IMAC ¹⁴C CFPS IC₅₀ Cell Gel expression ¹⁴C Suppression HIC Cell Killing Variant μg/mL rank order μg/mL Efficiency Binding DAR Profile (ng/ml) E293 147.6 9 50.7 19% + 1.5 WR 2.7 K334 135.7 3 72.5 28% + 0.12 Single 3 Peak E269 91.2 24 25.3 10% + 1.21 WR 1.4 S298 76.9 34 10.6  4% + 1.17 NWR 1.9 Single R292 160.9 7 54.8 21% + ND Peak 2.4 with sh E272 111.2 19 32.2 12% + 1.05 WR 3 V282 117.4 15 39.7 15% + 0.62 NWR 3.1 Y296 142.7 14 39.9 15% + ND Single 3.5 Peak D270 122.6 12 43.8 17% + ND Single 3.9 Peak N286 89.8 20 31.1 12% + 0.71 NWR 4 E333 96.1 8 52.6 20% + 0.55 NWR 5 S324 88.5 5 59.7 23% + 0.3 NWR 5.8 H268 140.2 4 67.6 26% + 0.46 NWR 7.9 T289 128.2 11 44 17% + 0.26 NWR 8.2 F275 97.3 13 42.4 16% + 0.24 NWR 9 I332 98.1 6 54.9 21% + 0.88 NWR 10 T335 123.7 2 82.4 32% + 0.33 NWR 11.2 P329 61.4 16 36.9 14% + 1.05 NWR 3.6 P271 78.2 18 32.3 12% + 1.05 WR 4.4 A330 63.6 10 44.7 17% + 1.51 NWR 4.5 Y300 124.0 27 17.4  7% + 0.13 NWR 4.8 S267 22.1 29 12.4  5% + 1.57 NWR 5.1 A327 38.0 22 26.3 10% + 0.69 NWR 5.2 G281 31.0 32 11.5  4% + 0.27 NWR 5.2 V305 42.8 35 10.2  4% + 0.06 NWR 5.5 N297 34.0 37 7.8  3% + 0.09 NWR 6.5 Q295 35.6 28 14.2  5% + ND Single 6.8 Peak R301 41.8 39 6  2% + ND Single 7.8 Peak E294 27.9 30 12  5% + 0.65 NWR 8.3 E283 32.1 31 11.7  4% + 0.24 NWR 8.7 D280 18.0 36 10.1  4% + ND Single 8.8 NWR K326 35.0 25 24.1  9% + 0.54 NWR 9 K317 38.0 33 11.4  4% + 0.22 NWR 9.5 K274 36.3 26 19.3  7% + 0.45 WR 10.2 T299 20.4 40 4.5  2% + ND NWR 10.3 V303 93.6 21 29 11% + 0.01 NWR 11.1 K320 13.5 38 7.1  3% + 0.93 NWR 11.9 Y278 181.6 1 94 36% + ND WR 22.5

TABLE 3 IMAC ¹⁴C CFPS IC₅₀ cell gel expression ¹⁴C Suppression HIC cell killing Variant μg/mL rank order μg/mL Efficiency Binding DAR Profile (ng/ml) R355 86 7 36 21% + 1.14 WR 13.8 T359 84 4 39 23% + 1.13 WR 7.5 N389 63 6 37 22% + 1.54 WR 9.7 S337 56 5 39 23% + ND Unresolved 6.9 A339 94 2 45 26% + ND Unresolved 7.7 L398 49 11 24 14% + ND Single Peak 9.4 Q438 56 36 ND ND + 1.46 WR 10 N421 27 15 18 11% + 1.35 WR 10.5 S375 68 8 27 16% + ND Single Peak 12.3 R344 52 14 20 12% + 1.13 WR 16.7 G420 19 18 15  9% + 1.34 WR 9.4 K340 20 20 12  7% + 1.49 Low Signal 12.9 Q386 9 19 13  8% + 1.49 NWR 16.9 K392 11 34 2  1% + 1 NWR 18 S383 39 13 23 13% + 0.72 WR 21.3 M358 25 16 16 10% + 0.93 21.6 E356 44 10 25 14% + 0.66 WR 27 K360 55 12 23 14% + 0.42 WR 31.2 Y436 22 35 ND ND + 0.6 Low Signal 32.6 N384 127 1 69 40% + 0.25 WR 34.9 Q342 6.5 30 4.5  3% + 1.59 Low Signal 35.1

Example 4 Assessment of the Thermal Stability of Exemplary Antibody-Drug Conjugates

This example describes experiments designed to measure the thermal stability (T_(m)) of aglycosylated trastuzumab and trastuzumab variants. The thermal shift assay was carried out by mixing the protein to be assayed (Sutroceptin and variants) with an environmentally sensitive dye (SYPRO Orange, Life Technologies Cat #S-6650) in a buffered solution (PBS), and monitoring the fluorescence of the mixture in real time as it undergoes controlled thermal denaturation. The final concentration of the protein in the assay mixture was between 100-250 g/mL, and the dye was 1:1000 diluted from the original stock (Stock dye is 5000× in DMSO). After dispensing 5 μL aliquots of the protein-dye mixture in a 384-well microplate (Bio-Rad Cat #MSP-3852), the plate was sealed with an optically clear sealing film (Bio-Rad Cat #MSB-1001), and placed in a 384-well plate real-time thermocycler (Bio-Rad CFX384 Real Time System). The protein-dye mixture was heated from 25° C. to 95° C., at increments of 0.1° C. per cycle (˜1.5° C. per minute), allowing 3 seconds of equilibration at each temperature before taking a fluorescence measurement. At the end of the experiment, the melting temperature (T_(m)) was determined using the Bio-Rad CFX manager software. For protein samples with complex thermal transition profiles, the melting temperature (T_(m)) is calculated from the negative first-order derivative plot of fluorescence intensity (Y-axis) against temperature (X-axis), or by fitting the data to the Boltzmann sigmoidal model. The difference in melting temperature of IgG variants compared to the wild-type protein is a measure of the thermal shift for the protein being assayed.

The results of this assay for certain variants are shown in Table 4. In general, deflections in T_(m) significantly below unsubstituted trastuzumab, particularly in T_(m)1, indicate an undesirable loss of stability and/or a propensity to aggregate. As such, trastuzumab variants that exhibit a T_(m)1 and/or T_(m)2 within about 5° C. of unsubstituted trastuzumab are preferred. More preferred are those variants that exhibit a T_(m)1 and/or T_(m)2 within about 3° C. of unsubstituted trastuzumab. Still more preferred are those variants that exhibit a T_(m)1 and/or T_(m)2 within about 2° C. of unsubstituted trastuzumab. Still more preferred are those variants that exhibit a T_(m)1 and/or T_(m)2 within about 1° C. of unsubstituted trastuzumab. The T_(m)1 and T_(m)2 can be measured in the unconjugated or conjugated forms.

TABLE 4 Antibody or Antibody-Drug Trastuzumab Conjugate Variant T_(m)1 (° C.) T_(m)2 (° C.) Antibody Aglycosylated 63.5 +/− 0.6 76.5 +/− 0.1 trastuzumab Antibody T359 64.0 +/− 0.4 76.8 +/− 0.0 Antibody E293 59.7 +/− 0.7 76.1 +/− 0.1 Antibody K334 49.5 +/− 0.8 76.5 +/− 0.1 Antibody R355 63.6 +/− 0.4 76.8 +/− 0.1 Antibody N389 62.8 +/− 0.1 76.8 +/− 0.0 Antibody-Drug T359 63.9 +/− 0.6 76.7 +/− 0.1 conjugate Antibody-Drug E293 59.2 +/− 0.0 76.0 +/− 0.3 conjugate Antibody-Drug K334 59.0 +/− 0.2 76.3 +/− 0.2 conjugate Antibody-Drug R355 63.7 +/− 0.4 76.6 +/− 0.1 conjugate Antibody-Drug N389 61.9 +/− 0.5 76.4 +/− 0.0 conjugate

Example 5 Preparation and Conjugation of Exemplary Fc Conjugates

This example describes preparation of exemplary Fc conjugates that are conjugated to a fluorescent dye. As shown in the example, certain tested residues that conjugated well to the dye also conjugated well to a cytotoxic agent in the context of Fc present in a full-length aglycosylated antibody. As such, it is believed that residues that are good sites for conjugation in a full length antibody are also good sites for conjugation to Fc.

To make the Fc conjugates, site directed mutagenesis was performed using a pYD plasmid containing coding region of Fc portion of aglycosylated trastuzumab with C-terminus his as DNA template and synthetic oligonucleotides containing mutations of interest in both sense and antisense directions. Oligonucleotides of each mutation were added to DNA template and PHUSION® polymerase (Thermo, Cat# F531 s) to a final volume of 20 μL. The final concentration of each component was 0.16 μM of each oligonucleotide, 0.5 ng/μL template DNA, 0.02 U/μL PHUSION® polymerase in HF buffer (Thermo) containing 1.5 mM MgCl₂ and 200 μM dNTP. Mixture was incubated at 98° C. 5 m, 18 PCR cycles (98° C. 30 s, 55° C. 1 m, 72° C. 4 m), 10 m at 72° C. and stored at 4° C. DpnI (NEB) was added to the mixture to final concentration of 0.6 U/μL and incubated for 37° C. 1 h. 5 μL of each mixture was transformed into 50 μL of Chemically Competent E. coli cells according to manufactures procedure (Invitrogen, MultiShot™ 96-Well Plate TOP10). Transformed cells were recovered in 200 μL SOC(Invitrogen) 37° C. 1 h and plated onto Luria-Bertani (LB) agar supplemented with 50 μg/mL Kanamycin (Teknova). After 37° C. 24 h, colonies were picked using Qpix2 (Genetix) into 200 μL LB with 7.5% glycerol and 50 μg/mL Kanamycin, and grown at 37° C. for 24 h, 20 μL of culture was used for rolling circle amplification and sequenced by primer extension using T7 (SEQ ID NO: 2, 5′ TAATACGACTCACTATAGG 3′) and T7 term (SEQ ID NO: 3, 5′ GCTAGTTATTGCTCAGCG 3′) primers (Sequetech). Sequence was analyzed by Sequencher (Gene Codes), and clones containing mutations were arrayed into 96 well plates.

Expression of the Fc proteins was performed using cell free protein synthesis system as described in Zawada et al., 2011, Biotechnol. Bioeng. 108(7)1570-1578 with the modifications described below. Cell-free extracts were treated with 50 μM iodoacetamide for 30 min at RT (20° C.) and added to a premix containing all other components except for DNA. Cell free reactions were initiated by addition of plasmid DNA of variants of interest and incubated at 30° C. for 5 h 850 rpm in 96 well plates. Reaction was incubated further at 4° C. for 11 h. The final concentration of each components was 30% cell extract, 20 μM pAzF-tRNA, 2 mM GSSG, 100 g/mL E. coli DsbC, 8 mM magnesium glutamate, 10 mM ammonium glutamate, 130 mM potassium glutamate, 35 mM sodium pyruvate, 1.2 mM AMP, 0.86 mM each of GMP, UMP, and CMP, 2 mM amino acids (except 0.5 mM for Tyrosine), 4 mM sodium oxalate, 1 mM putrescine, 1.5 mM spermidine, 15 mM potassium phosphate, 100 nM T7 RNAP, 10 μg/mL DNA of each Fc Variant.

Samples were then purified by IMAC Phytip and by Protein A Phytip. For samples prepared by IMAC Phytip, each Fc variant was produced in 30 μL scale in a 96 well plate. Each CFPS reaction was diluted with 30 μL PBS (Gibco, pH7.4). The mixture was centrifuged 5000×g 4° C. for 15 m. Supernatant was captured with IMAC Phytip containing 5 μL resin (PhyNexus) by pipetting up and down 4 times slowly with a flow rate of 250 l/min. Resin was washed with 200 μL IMAC binding buffer (50 mM Tris pH 8.0, 300 mM NaCl, 10 mM imidazole) by pipetting up and down twice at a flow rate of 500 μL/min. This process was repeated once with an additional 200 μL IMAC binding buffer. Bound protein was eluted with 125 μL IMAC elution buffer (50 mM Tris pH 8.0, 300 mM NaCl, 500 mM Imidazole) by pipetting up and down 4 times at a flow rate of 250 μL/min.

For samples purified by Protein A Phytip, each Fc variant was produced in 30 μL scale in a 96 well plate. Each CFPS reaction was diluted with 30 μL PBS (Gibco, pH7.4). The mixture was centrifuged 5000×g 4° C. for 15 m. Supernatant was captured with Protein A Phytip containing 5 μL resin (PhyNexus) by pipetting up and down 4 times with a flow rate of 250 μl/min. Resin was washed with 200 μL PBS by pipetting up and down twice at a flow rate of 500 μL/min. Resin was then washed with 200 μL 150 mM NaCl. Bound proteins was eluted with 125 μL Glycine at pH 3.0 (100 mM) by pipetting up and down 4 times at a flow rate of 250 μL/min. The eluted protein was immediately neutralized with 50 μL of 1M Tris 8.0. The neutralized elution buffer contains 71 mM Glycine and 285 mM Tris with pH of approximately 7.8.

The conjugation to an exemplary fluorescent dye was performed as follows. DIBO-TAMRA (Invitrogen C-10410) (structure shown as FIG. 5) was dissolved in anhydrous DMSO to a final concentration of 5 mM. The compound was diluted 10× into with PBST (PBS with 0.2% Tween 20) to 500 μM and then added to Fc-his variants in IMAC elution buffer to the final drug concentration of 50 μM. Mixture was incubated at 30° C. 250 rpm for 16 h in a Thermomixer (Thermo). Sodium Azide was added to samples to final concentration of 100 mM.

To determine amounts of FC expression, Affinity Phytip purified samples were analyzed for protein A binding using ForteBio Protein A tips (18-5010). Each sample was diluted 2 to 10 fold in PBS based kinetic buffer (ForteBio 18-5032). Samples were incubated with ForteBio Protein A tip for 300 s and dissociated for 600 s in kinetic buffer. On rate was measured and compared to a standard curve generated using purified Fc-his (0.4 to 25 ug/mL) to determine concentration. Expression levels were normalized to expression of the S375pAzF variant.

To determine the amount of conjugation, IMAC or Protein A Phytip purified samples were mixed with equal volume of 2× Laemmli sample buffer and fractionated by 4-12% gel (BioRad). Fluorescence intensity of TAMRA conjugated protein was visualized by GelDoc EZ (BioRad, UV excitation filter (280-400 nM)). Relative fluorescence intensity of each gel band was quantitated and normalized to that of conjugated product of Fc variant S375pAzF produced on the same plate using corresponding plasmid DNA (SD2009).

Of tested Fc proteins, substitutions at positions corresponding to S239 and Y296 were particularly well-expressed and conjugated. Accordingly, substitutions at these positions are particularly preferred for Fc conjugates.

Example 6 Assessment of Expression of HC-F404 in an RF-1 Attenuated Extract

To initially assess the effects of using an RF-1 attenuated E. coli strain on incorporation of a non-natural amino acid, 12 variants which poorly expressed in the RF-1 positive strain were expressed and scaled up in the RF-1 attenuated strain. One such variant, HC-F404, exhibited exceptionally desirable properties, as discussed below.

The cell free extract used for this cell free protein synthesis reaction was prepared from an OmpT sensitive RF-1 attenuated E. coli strain which has also been engineered to produce an orthogonal CUA-encoding tRNA for insertion of a non-natural amino acid at an Amber Stop Codon. After addition of DNA template that encodes for HC-F404 and WT LC, the cell free reaction was incubated at 30° C. for 12 h on a shaker at 650 rpm at a 1 ml scale×6 (for a total of 9 ml) in a Flower plate (m2p-labs # MTP-48-B). The protein was then purified over Protein A and Capto Adhere resin using a Protein Maker (Emerald Bio).

Conjugation with drug (DBCO-MMAF) and preparation of Antibody Drug Conjugates (ADC's) for further analysis was done as described previously.

Thermofluor Analysis of the Variants was Carried Out as Described Previously

HC variant F404 generated both in during TAG scan as well as in the intermediate scale showed good cell-killing capability (TAG Scan I: IC50=0.018 nM; Intermediate Scale up: IC50=0.023 nM), but DAR estimation on the basis of its HIC profile proved problematic, due to low resolution of the peaks, possibly due to inaccessibility of the drug to the analytical column during HIC analysis. However, MS analysis showed that it had a DAR of 1.74. Also, thermofluor analysis showed that the ADC version of HC-F404 had an improved (higher thermal stability) T_(m)1 (increase of 2.2° C.) compared to the antibody only.

TABLE 5 Properties of HC-F404 variant Final Purified ADC concentration MS SKBR3 Cell Killing Chain Variant (ug/mL) DAR Profile IC50 (ng/mL) HC F404 541 1.74 WR* 3.5 *This DAR result is based on LC-MS analysis. The conjugated variant produced a Not-Well-Resolved (NWR) peak on the HIC assay, and DAR was determined using LC-MS

TABLE 6 Thermofluor results for HC-F404 variant Antibody-Drug Trastuzumab Antibody Only Conjugate Chain Variant T_(m)1 (° C.) T_(m)2 (° C.) T_(m)1 (° C.) T_(m)2 (° C.) HC F404 61.2 +/− 76.6 +/− 63.5 +/− 76.5 +/− 0.2 0.1 0.4 0.1

Example 7 Characterization of Exemplary Antibody-Drug Conjugates: Release Factor Analysis TAG Scan I Methods and List of Selected Variants

The cell free extract used for cell free protein synthesis reactions were prepared from an OmpT sensitive RF-1 attenuated E. coli strain which has also been engineered to express an orthogonal CUA-encoding tRNA for insertion of a non-natural amino acid at an Amber Stop Codon. After addition of DNA template, cell free reactions were incubated at 30° C. for 12 h on a shaker at 650 rpm in Flower plates (m2p-labs # MTP-48-B). For synthesis of light chain variants (variants which had nnAA incorporation on the light chain), a DNA ratio of 4 ug/mL of light chain DNA to 8 μg/mL of heavy chain DNA was used. Each trastuzumab variant was produced in 1 mL scale in 48-well flower plates in singlicate. A total of 6 plates were used to express the variants.

Supernatant was captured with IMAC Phytip containing 40 μL resin by pipetting up and down 10 times slowly at a flow rate of 4.2 uL/min. Bound protein was eluted with 125 μL IMAC elution buffer (50 mM Tris pH 8.0, 300 mM NaCl, 500 mM Imidazole). Following purification, IMAC purified variants were quantified on a Caliper GXII system by comparing with by mass standards of HERCEPTIN® run on the same Protein Express LabChip (Caliper LifeSciences #760499). Samples were prepared for analysis as specified in the Protein Express Reagent Kit (Caliper Life Sciences #760328) with the exception that the samples (mixed in sample buffer) were heated at 65° C. for 10 minutes prior to analysis on the Caliper system.

Conjugation with drug (DBCO-MMAF) and preparation of Antibody Drug Conjugates (ADC's) for further analysis was done as described previously.

Positive results of these characterization experiments are presented in Table 7. In the Table, the following properties of the variants are shown: final concentration of antibody-drug conjugate made, ability to bind HER2-expressing cells, drug-antibody ratio (DAR, expressed as the number of cytotoxic agents per antibody), a comment regarding the HIC profile, and the observed IC₅₀ from the cell-killing assay described previously. In the comments regarding the HIC assay, SP corresponds to a profile that resolved well as a single peak, NWR corresponds to a poorly resolved profile and WR corresponds to a well-resolved HIC profile. Several of the variants that exhibited a single peak profile also exhibited potent killing, which indicates that the variant was indeed conjugated with drug, but failed to resolve on the HIC column.

TABLE 7 Selected Variants for Tag Scan I Final SKBR3 Purified cell ADC HIC killing IC₅₀ Variants Chain (ug/mL) DAR Profile (ng/mL) V282 HC 52 1.0 NWR 1.6 T289 HC 46 0.3 WR 2.1 Y296 HC 32 1.5 WR 7.2 A330 HC 80 ND SP 2.4 T335 HC 43 0.7 NWR 2.3 N361 HC 42 1.0 WR 5.9 S400 HC 51 0.5 WR 7.1 F404 HC 20 ND NWR 4.3 V422 HC 100 1.3 WR 4.0 S440 HC 74 1.2 WR 6.1 T260 HC 37 0.1 NWR 4.4 S267 HC 94 1.7 WR 7.2 H268 HC 45 0.4 NWR 7.5 E272 HC 50 1.2 WR 3.0 K274 HC 64 0.8 WR 2.8 R292 HC 101 ND SP 9.0 E293 HC 133 1.7 WR 9.0 N297 HC 45 1.7 WR 4.7 S298 HC 60 1.5 WR 5.4 V303 HC 75 1.5 WR 12.7 V305 HC 66 1.5 WR 7.0 I332 HC 47 ND NWR 2.6 E333 HC 47 ND NWR 2.6 K334 HC 83 1.8 WR 6.9 K340 HC 76 1.5 WR 4.4 G341 HC 29 1.5 WR 3.8 Q342 HC 60 1.3 WR 4.8 P343 HC 51 0.8 WR 6.6 R355 HC 54 1.4 WR 4.6 Q362 HC 89 1.1 WR 6.3 Q386 HC 124 ND NWR 4.0 K392 HC 40 1.6 WR 3.4 S424 HC 24 1.3 WR 4.9 Q438 HC 33 1.5 WR 5.2 S442 HC 57 1.4 WR 4.3 L443 HC 39 1.4 WR 3.0

Preferred variants include those with a non-natural amino acid replacing these positions of the heavy chain (HC): V282, T289, Y296, A330, T335, N361, S400, F404, V422, S440, T260, S267, H268, E272, K274, R292, E293, N297, S298, V303, V305, 1332, E333, K334, K340, G341, Q342, P343, R355, Q362, Q386, K392, F404, S424, Q438, S442 and L443.

Variants that have a DAR of at least about 0.7 are also preferred. Preferred variants include those with a non-natural amino acid replacing these positions of the heavy chain (HC): V282, Y296, T335, N361, V422, S440, S267, E272, K274, E293, N297, S298, V303, V305, K334, K340, G341, Q342, P343, R355, Q362, K392, F404, S424, Q438, S442 and L443.

Variants that have a DAR of at least about 1.0 are also preferred. Still more preferred were variants including those with a non-natural amino acid replacing these positions of the heavy chain: V282, Y296, N361, V422, S440, S267, E272, E293, N297, S298, V303, V305, K334, K340, G341, Q342, R355, Q362, K392, F404, S424, Q438, S442 and L443.

Variants that have a DAR of at least about 1.2 are also preferred. Still more preferred were variants including those with a non-natural amino acid replacing these positions of the heavy chain: V282, Y296, V422, S440, S267, E272, E293, N297, S298, V303, V305, K334, K340, G341, Q342, R355, K392, F404, S424, Q438, S442 and L443.

Variants that have a DAR of at least about 1.5 are also preferred. Still more preferred were variants including those with a non-natural amino acid replacing these positions of the heavy chain: V282, Y296, S267, E293, N297, S298, V303, V305, K334, K340, G341, K392, F404, and Q438.

Example 8 Characterization of Exemplary Antibody-Drug Conjugates: Additional Release Factor Analysis TAG Scan II Methods and List of Selected Variants

The cell free extract used for cell free protein synthesis reactions were prepared from an OmpT sensitive RF-1 attenuated E. coli strain which has also been engineered to produce an orthogonal CUA-encoding tRNA for insertion of a non-natural amino acid (p-azido-phenylalanine) at an Amber Stop Codon. After addition of DNA template, cell free reactions were incubated at 30° C. for 12 h on a shaker at 650 rpm in Flower plates (m2p-labs # MTP-48-B). For synthesis of light chain variants (variants which had nnAA incorporation on the light chain), a DNA ratio of 4 μg/mL of light chain DNA to 8 μg/mL of heavy chain DNA was used. Each trastuzumab variant was produced in 1 mL scale in 48-well flower plates in singlicate. A total of 6 plates were used to express the variants.

Supernatant was captured with IMAC Phytip containing 40 μL resin by pipetting up and down 10 times slowly at a flow rate of 4.2 μL/min. Bound protein was eluted with 125 μL IMAC elution buffer (50 mM Tris pH 8.0, 300 mM NaCl, 500 mM Imidazole). Following purification, IMAC purified variants were quantified on a Caliper GXII system by comparing with by mass standards of HERCEPTIN® run on the same Protein Express LabChip (Caliper LifeSciences #760499). Samples were prepared for analysis as specified in the Protein Express Reagent Kit (Caliper Life Sciences #760328) with the exception that the samples (mixed in sample buffer) were heated at 65° C. for 10 minutes prior to analysis on the Caliper system.

Conjugation with drug (DBCO-MMAF) and preparation of Antibody Drug Conjugates (ADCs) for further analysis was done as described previously.

Positive results of these characterization experiments are presented in Table 8. In the Table, the following properties of the variants are shown: final concentration of antibody-drug-conjugate made, the efficiency of expression of full length product compared to wild-type (suppression efficiency), ability to bind HER2-expressing cells, drug-antibody ratio (DAR, expressed as the number of cytotoxic agents per antibody), a comment regarding the HIC profile, and the observed IC₅₀ from the cell-killing assay described previously. In the comments regarding the HIC assay, SP corresponds to a profile that resolved well as a single peak, NWR corresponds to a poorly resolved profile and WR corresponds to a well-resolved HIC profile. Several of the variants that exhibited a single peak profile also exhibited potent killing, which indicates that the variant was indeed conjugated with drug, but failed to resolve on the HIC column.

In total, trastuzumab variants having a non-natural amino acid at 269 sites that were screened for the activities as described. Table 8 presents the 71 variants that had desirable DAR, expression profiles, suppression efficiencies, and/or IC₅₀ values. None of these properties could be predicted prior to testing, and many times the values for each individual property varied unpredictably within a sample. For example, one would generally expect that variants with high DAR would exhibit lower IC₅₀ values than those with low DAR. Nonetheless, the 151 variant, for example, exhibited a relatively low DAR of 0.6 yet exhibited an IC₅₀ of 1 ng/ml.

TABLE 8 Tag Scan II Variants MMAF SKBR3, Suppression, (Post MQ) HIC IC₅₀, % of wt by Variants Chain (ug/mL) DAR Profile ng/mL ¹⁴C D221 HC 178 1.4 WR 9.6 75 K222 HC 194 1.5 WR 8.4 87 T225 HC 156 1.4 WR 8.3 81 P227 HC 96 1.5 NWR 6.2 68 P230 HC 157 1.5 NWR 6.0 122 A231 HC 147 1.6 NWR 5.5 97 P232 HC 107 1.5 NWR 5.5 86 G236 HC 74 1.6 WR 4.5 70

Preferred variants include those with a non-natural amino acid replacing these positions of the heavy chain (HC): D221, K222, T225, P227, P230, A231, P232, and G236.

Variants that have a DAR of at least about 0.7 are also preferred. Preferred variants include those with a non-natural amino acid replacing these positions of the heavy chain (HC): D221, K222, T225, P227, P230, A231, P232, and G236.

Variants that have a DAR of at least about 1.0 are also preferred. Still more preferred were variants including those with a non-natural amino acid replacing these positions of the heavy chain: D221, K222, T225, P227, P230, A231, P232, and G236.

Variants that have a DAR of at least about 1.2 are also preferred. Still more preferred were variants including those with a non-natural amino acid replacing these positions of the heavy chain: D221, K222, T225, P227, P230, A231, P232, and G236.

Variants that have a DAR of at least about 1.5 are also preferred. Still more preferred were variants including those with a non-natural amino acid replacing these positions of the heavy chain: K222, P227, P230, A231, P232, and G236.

Example 9 Characterization of Exemplary Antibody-Drug Conjugates: Scale Up Analysis

Scale Up of Select Variants from TAG II Scan

Based on DAR and cell-killing data, a subset of trastuzumab variants with desirable characteristics were picked for small scale cell free expression in order to generate material for further characterization. The heavy chain variants selected for small scale production were: D221 and K222.

The cell free reaction mix in which the variants were synthesized comprised of a 80%:20% blend of cell free extracts made from an OmpT sensitive RF-1 attenuated E. coli strain, and an OmpT sensitive RF-1 attenuated E. coli strain that was engineered to express an orthogonal CUA encoding tRNA, respectively. All variants were scaled up to 9 ml in flower plates (1.5 mL X 6 replicates) and purified using Protein Maker (Emerald Bio).

Conjugation with drug (DBCO-MMAF) and preparation of Antibody Drug Conjugates (ADC's) for further analysis was done as described previously.

Positive results of these characterization experiments are presented in Table 9. In the Table, the following properties of the variants are shown: final concentration of antibody-drug-conjugate made, the efficiency of expression of full length product compared to wild-type (suppression efficiency), ability to bind HER2-expressing cells, drug-antibody ratio (DAR, expressed as the number of cytotoxic agents per antibody), a comment regarding the HIC profile, and the observed IC50 from the cell-killing assay described previously. In the comments regarding the HIC assay, SP corresponds to a profile that resolved well as a single peak, NWR corresponds to a poorly resolved profile and WR corresponds to a well-resolved HIC profile.

TABLE 9 A Subset of Preferred Fc Variants Final Purified SKBR3 ADC Cell concentration HIC Killing IC₅₀ Chain Variant (ug/mL) DAR Profile (ng/mL) HC D221 284 1.16 WR 3.5 HC K222 308 1.21 WR 3.0

Example 10 Characterization of Exemplary Antibody-Drug Conjugates: Thermofluor Analysis

Thermofluor analysis of the variants was carried out as described previously. Thermofluor results are in Table 10.

TABLE 10 Thermofluor data for small Scale production variants Antibody-Drug Trastuzumab Antibody Only Conjugate Chain Variant T_(m)1 (° C.) T_(m)2 (° C.) T_(m)1 (° C.) T_(m)2 (° C.) Aglycosylated 61.4 +/− 76.2 +/− N/A N/A Trastuzumab 0.6 0.1 HC D221 59.1 +/− 76.7 +/− 59 +/− 76.7 +/− 0   0.1 0.1 0.1 HC K222 59.1 +/− 76.7 +/− 59 +/− 76.5 +/− 0.1 0.1 0.1 0.1

Variants subject to thermofluor analysis included those with a non-natural amino acid replacing these positions of the heavy chain: D221 and K222.

Example 11 Characterization of Exemplary Antibody-Drug Conjugates: Kinetics Analysis Conjugation Kinetics of Selected Variants

Rates of conjugation were determined for five variants that showed wide range of drug to antibody conjugate ratio. Reactions were initiated by mixture of variants with drug (DBCO-MMAF) in PBS (pH7.4) at 20° C. in duplicates for 15 min, 30 min, 1 hr, 2 hr, 4 hr, 8 hr and 16 hr. Final concentrations of antibody in mixture range from 0.2 to 2 uM. Final drug concentration for all reaction is 100 uM. At the end of incubation period, NaAzide was added to the mixture to a final concentration of 10 mM. Final mixture was purified by Zeba and MustangQ plates as previously described. Concentration of IgG was determined by Caliper using HERCEPTIN® as mass standard. Fraction of fully conjugated variants was determined by HIC as previously described.

TABLE 11 Half-lives and DAR of variants HC-S112 HC-T110 HC-T77 HC-Y79 HC-F126 T½ 6.2h 10.5h ~40h ~270h ND DAR 1.7 1.6 1.1 0.4 0

FIGS. 5A and 5B provides the HIC traces of two variants (HC T110 and HC S112). The traces showed three peaks (P0, P1 and P2) corresponding to peaks with unconjugated IgG, singly conjugated IgG and fully conjugate IgG, respectively.

FIGS. 6A through 6E show that conjugation of para-azido phenylalanine (pAzF) variants is site specific. Percentages of total of single and fully conjugated antibodies were separated by HIC.

Example 12 Characterization of Exemplary Antibody-Drug Conjugates: Suppression Comparison Analysis Comparison of Suppression by Para-Azido Phenylalanine (pAzF) and Par-Azido Methyl Phenylalanine (pAzMeF)

Eleven variants with a range of suppression efficiencies were expressed in a cell-free protein synthesis reaction as follows as described in Zawada et al., 2011, Biotechnol. Bioeng. 108(7): 1570-1578 with the modifications described below. Unsubstituted trastuzumab was also made as a control. Cell-free extracts containing tRNA and OmpT sensitive RF1(80/20 blend of strains 16/23) were treated with 50 μM iodoacetamide for 30 min at RT (20° C.) and added to a premix containing all other components except for GSSG, non-natural amino acids, tRNA synthetases, T7 RNAP, DsbC, PDI, and template DNA. All remaining reagents except DNA were then added to the mixture. Cell free reactions were initiated by addition of plasmid DNA of selected variants and incubated at 30° C. for 12 h on a shaker at 450 rpm in 96-well plates. The reaction was incubated further at 4° C. for 5 h. The final concentration in the protein synthesis reaction was 30% cell extract, 1 mM para-azido phenylalanine (pAzF) (RSP Amino Acids) with 0.37 mg/mL M. jannaschii pAzF-specific amino-acyl tRNA synthetase (FRS), or 1 mM para-azido methyl phenylalanine (pAzMeF) with 0.37 mg/mL p-cyanophenylalanine specific aminoacyl-tRNA synthetase (Young et al, 2011, Biochem. 50: 1894-1900), 2 mM GSSG, 0.29 mg/mL PDI (Mclab), 100 μg/mL E. coli DsbC, 8 mM magnesium glutamate, 10 mM ammonium glutamate, 130 mM potassium glutamate, 35 mM sodium pyruvate, 1.2 mM AMP, 0.86 mM each of GMP, UMP, and CMP, 2 mM amino acids (except 0.5 mM for Tyrosine and Phenylalanine), 4 mM sodium oxalate, 1 mM putrescine, 1.5 mM spermidine, 15 mM potassium phosphate, 100 nM T7 RNAP, 2 μg/mL trastuzumab light chain DNA, 8 μg/mL trastuzumab-(His)₆ heavy chain DNA. Each trastuzamab variant was produced in 100 uL scale in 96-well plates in duplicates with ¹⁴C for each variant. It should be noted that all trastuzumab variants thus produced were aglycosylated.

To monitor protein synthesis reactions were spiked with 3% (v/v) 1-[U-¹⁴C]-leucine (300 mCi/mmole; GE Life Sciences, Piscataway, N.J.). The suppression of amber codon at different sites of the heavy chain and light chain was determined by [¹⁴C]-autoradiograhy of reducing SDS-PAGE gels. Full length trastuzumab heavy chain and suppressed tastuzumab heavy chain variants run at 50 kD on SDS-PAGE. Full length trastuzumab light chain and suppressed tastuzumab light chain variants run at 30 kD on SDS-PAGE. Non suppressed (truncated) trastuzumab variants run at a lower molecular weight. Amber suppression in the heavy or light chain is determined by:

${suppression} = \frac{\begin{matrix} {{band}\mspace{14mu} {intensity}\mspace{14mu} {of}\mspace{14mu} {suppressed}\mspace{14mu} {heavy}\mspace{14mu} {or}} \\ {{light}\mspace{14mu} {chain}\mspace{14mu} {TAG}\mspace{14mu} {variant}} \end{matrix}}{{band}\mspace{14mu} {intensity}\mspace{14mu} {of}\mspace{14mu} {wild}\mspace{14mu} {type}\mspace{14mu} {heavy}\mspace{14mu} {or}\mspace{14mu} {light}\mspace{14mu} {chain}}$

Band intensity was determined by ImageQuant (Amersham Biosciences Corp.; Piscataway, N.J.). In FIG. 6A, the suppression efficiency of pAzF and pAzMeF variants are compared. Suppression efficiency is calculated as variant HC band intensity over wild type HC band intensity or variant LC band intensity over wild type LC band intensity. In FIG. 6B, the soluble IgG yield of pAzF and pAzMeF variants are compared. The amount of IgG is computed according to the formula: soluble counts/full counts*IgG band intensity/total lane intensity*2*74008.02 Da/47 Leucines.

Example 13 Transfer of Sites of Non-Natural Amino Acid Incorporation to Another Antibody

To test whether sites for incorporation of non-natural amino acids can be transferred between two distinct IgG sequences with predictable DARs, a second antibody containing a non-natural amino acid at representative sites was assessed. For this experiment, brentuximab was chosen as the second antibody (See SEQ ID NO: 3 and SEQ ID NO: 4). Several sites were chosen from the trastuzumab mutagenesis study exhibiting varying DARs. In the heavy chain, sites corresponded to K121, Y180, K133, S157, F126, and P127 (EU Numbering). In the light chain, sites corresponded to R142, N152, Q147, E161, K149, and Q155 (EU Numbering). The TAG stop codon was inserted into the brentuximab sequence using standard quick change based site directed mutagenesis, and the identity of each variant was confirmed by DNA sequencing. Mini Prep DNA was prepared using a Qiagen Kit, for use as a template to drive the cell free protein synthesis reaction.

The variants were synthesized in a reaction mix that contained an OmpT sensitive RF-1 protein and an in vivo expressed orthogonal CUA encoding tRNA. All variants were scaled up to 9 ml in flower plates (1.5 mL X 6 replicates) and purified using a Protein Maker (Emerald Bio).

Conjugation with drug (DBCO-MMAF) and preparation of Antibody Drug Conjugates (ADC's) for further analysis was done as described previously. DARs were identified by the HIC profiling method previously described.

The results from the experiment suggest that sites can generally be predictably transferred between two distinct IgGs.

TABLE 12A A Subset of Preferred HC variants DAR Variants trastuzumab brentuximab HC-K121 ^(TAG) 1.6 1.4 HC-Y180 ^(TAG) 1.6 0.5 HC-K133 ^(TAG) 0.7 0.8 HC-S157 ^(TAG) 0.9 1.1 HC-F126 ^(TAG) — — HC-P127 ^(TAG) — —

TABLE 12B A Subset of Preferred LC variants DAR Variants trastuzumab brentuximab LC-R142 ^(TAG) 1.2 1.3 LC-N152 ^(TAG) 1.2 1.3 LC-Q147 ^(TAG) 0.8 1 LC-E161 ^(TAG) 0.8 NA LC-K149 ^(TAG) 0.1 0.1 LC-Q155 ^(TAG) 0.1 0.1

To further characterize brentuximab conjugates, variants containing amber mutations at heavy chain positions K121 or F404 or light chain position N152 were selected for scale up and additional characterization. As a control, trastuzumab containing an amber mutation at F404 was also expressed TAG codon was inserted by overlapping PCR mutagenesis at the nucleotides corresponding to positions K121 and F404 on heavy chain and N152 on light chain and separately cloned into expression vector pYD317.

The cell free reaction mix in which the brentuximab and trastuzumab variants were synthesized comprised a blend of cell free extracts made from an OmpT sensitive RF-1 attenuated E. coli strain, and an OmpT sensitive RF-1 attenuated E. coli strain which was engineered to produce an orthogonal CUA-encoding tRNA for insertion of a non-natural amino acid at an Amber Stop Codon. The variants were expressed in a cell-free protein synthesis reaction as described in Zawada et al., 2011, Biotechnol. Bioeng. 108(7)1570-1578 with the modifications described below. Cell-free extracts were treated with 50 μM iodoacetamide for 30 min at RT (20° C.) and added to a premix containing all other components except for DNA encoding the variants of interest. The final concentration in the protein synthesis reaction was 30% cell extract, 1 mM para-azido methyl phenylalanine (pAzMeF) (RSP Amino Acids), 0.37 mg/mL M. jannaschii pAzMeF-specific amino-acyl tRNA synthetase (FRS), 2 mM GSSG, 0.29 mg/mL PDI (Mclab), 30 μg/mL E. coli DsbC, 8 mM magnesium glutamate, 10 mM ammonium glutamate, 130 mM potassium glutamate, 35 mM sodium pyruvate, 1.2 mM AMP, 0.86 mM each of GMP, UMP, and CMP, 2 mM amino acids (except 0.5 mM for Tyrosine and Phenylalanine), 4 mM sodium oxalate, 1 mM putrescine, 1.5 mM spermidine, 15 mM potassium phosphate, 100 nM T7 RNAP. Plasmid ratios of HC to LC were tested to find the optimal condition: 3:1, 2:1, 1:1 and 1:2. Total plasmid concentration kept constant at 10 ug/mL. After addition of DNA template, cell free reactions were incubated at 30° C. for 12 h on petri dishes and followed by 14C assay analysis. Maximum yield was achieved at 1:1 ratio. 10 mL cell-free reactions were carried under this condition.

To purify the variants, 10 ml of crude cell-free for each variant was first diluted 1:0.5 with equilibration buffer (50 mM sodium phosphate, pH 7) and spun at 11,000×g for 30 minutes. The supernatant was then passed through a 0.45 micron syringe filter prior to being loaded with a 2 minute residence time onto a pre-equilibrated 1 mL MabSelect Sure HiTrap (GE Lifesciences) to capture the IgG variants. The column was then washed with 7.5 CV (column volume) of wash buffer 1 (100 mM sodium phosphate and 800 mM Arginine, pH 7) and followed by 7.5 CV of wash buffer 2 (50 mM sodium phosphate and 0.5% (v/v) Triton X-100, pH 7.3). After washing with 7.5 CV of equilibration buffer, each variant was eluted with 4 CV elution buffer (100 mM sodium citrate and 300 mM Arginine, pH 3). The elution pool was adjusted to pH 7 by addition of 30% (v/v) of 1M Tris, pH9.

The collected elution pool was buffer exchanged into PBS via overnight dialysis in 10 kD Slide-A-Lyzer (Pierce) units. The dialyzed material was then concentrated using an Amicon Ultra-15 (Millipore) centrifugal filter unit to a concentration of 5-10 mg/mL.

The purified variants were conjugated as follows. DBCO-MMAF 2 (ACME Bioscience; Palo Alto, Calif.) was dissolved in DMSO to a final concentration of 5 mM. The compound was diluted with PBS to a concentration of 1 mM and then added to purified trastuzumab variants in IMAC elution buffer to achieve a final drug concentration of 100 μM. Mixture was incubated at RT (20° C.) for 17 hours. Reaction was stopped by adding Sodium Azide to final concentration of 100 mM and buffer exchanged using Zeba plates (Thermo Scientific; Waltham, Mass.) equilibrated in 1×PBS.

DAR for the aglycosylated variants was determined by LC-MS as follows. ADCs by LCMS

Samples were run on a Waters Aquity UPLC system attached a Xevo QTOF. Proteins were separated on an Agilent PLRP-S column (2.3×50 mm, 5 μm, 4000 Å) at 80° C. Mobile phases: A: 0.1% formic acid water; b: 20:80 isopropanol:acetonitrile, 0.1% formic acid. Samples were desalted on column for 0.4 minutes at 10% B followed by a step gradient from 30% B to 40% B over 7 minutes, 40% B to 60% B over 3 minutes. Data was acquired over the whole LC elution using a cone voltage of 35V. Spectra were analyzed using MassLynx software. DAR values were calculated as a weighted average and are shown in Table 13

TABLE 13 DAR values for ADCETRIS ® ADCs Variant Drug DAR Aglycosylated Brentuximab IgG HC- DBCO- 1.87 F404 MMAF 2 Aglycosylated Brentuximab IgG HC- DBCO- 1.79 K121 MMAF 2 Aglycosylated Brentuximab IgG LC- DBCO- 1.74 N152 MMAF 2 Aglycosylated Trastuzumab IgG DBCO- 1.97 HC-F404 MMAF 2

Next, cell binding and cell killing activities of the variants were measured as follows. Karpas 299 and L540 cell lines were obtained from German Collection of Microorganisms and Cell Cultures (DSMZ) and SKBR3 and Raji cell lines were obtained from American Type Culture Collection (ATCC). Cells were grown in RPMI 1650 medium (Cellgro-Mediatech; Manassas, Va.) containing 20% heat-inactivated fetal bovine serum (Hyclone; Thermo Scientific; Waltham, Mass.), 2 mM glutamax (Invitrogen; Carlsbad, Calif.) and 1× penicillin/streptomycin (Cellgro-Mediatech; Manassas, Va.). Adherent SKBR3 cells were grown in DMEM/Nutrient F-12 Ham (50:50) high glucose medium (Cellgro-Mediatech; Manassas, Va.) supplemented with 10% heat-inactivated fetal bovine serum, 2 mM glutamax and 1× penicillin/streptomycin. All cells were grown and maintained at 37° C. in a 5% CO2 incubator. SKBR3 were passaged by washing with calcium/magnesium-free Phosphate Balanced Saline (PBS) and harvested with HyQTase (Hyclone; Thermo Scientific; Waltham, Mass.).

Cell killing activities of Trastuzumab and Brentuximab (ADCETRIS®) variants were measured using CellTiter-Glo® cell proliferation assay. Briefly, 10⁴ suspension cells or 3×10³ SKBR3 cells were plated in 40 μl per well in a 96-well half-area white TC-treated plate and incubated at 37° C. for 2 hours prior to adding antibodies. Antibodies were formulated in the respective cell culture media and filter-sterilized with 0.45 μm celluloase aceteate Costar Spin-X® centrifuge tube filters (Corning; Tewksbury, Mass.). 40 μl of unconjugated antibody, AB4285-conjugated antibody or AB4285 free drug were added to cells and cultured for either 3 days or 5 days for suspension cells and SKBR3 cells, respectively. Cell viability was measured by adding 80 μl of Cell Titer-Glo® reagent (Promega Corp.; Madison, Wis.) to each well and processed following product instructions. Luminescence was measured with the ENVISION® plate reader (Perkin-Elmer; Waltham, Mass.). Relative luminescence readings were converted to % viability using untreated cells as controls and plotted versus antibody concentration. IC50 was calculated from the non-linear regression equation, log (inhibitor) versus response—variable slope (4 parameters) using the statistical software, Prism (GraphPad Software; San Diego, Calif.). Results are shown in FIGS. 8 and 9 and Table 14.

Cell binding affinities of trastuzumab and brentuximab variants were measured using a FACS-based assay. Briefly, 2×10⁵ Karpas 299 or L540 cells in 25 μl of binding buffer, which consists of PBS containing 0.5% bovine serum albumin (BSA) (Invitrogen; Carlsbad, Calif.) and 0.09% sodium azide (Sigma; St. Louis, Mo.) were plated in a 96-well U-bottom polypropylene plate (Greiner Bio-One; Monroe, N.C.). Antibodies were formulated in binding buffer and 2-fold serial dilutions were prepared in a separate 96-well plate. Equal volume of diluted antibodies were added to cells and incubated at 4° C. for 1 hour. Cells were washed twice with 200 μl of binding buffer to remove unbound antibodies. For secondary antibody detection, 5 μg/ml Alexa-488 conjugated goat anti-human IgG (Invitrogen; Carlsbad, Calif.) was prepared in binding buffer and 50 μl was added to cells and incubated at 4° C. for 1 hour. Cells were washed twice, resuspended in 200 μl of binding buffer and analyzed using a BD FACScan® flow cytometer (Cytek; Fremont, Calif.) equipped with a 96-well automated microsampler (Cytek; Fremont, Calif.). Data was acquired and analyzed using FlowJo (Tree Star; Ashland, Oreg.). Mean fluorescence intensity readings were expressed as % binding by normalizing to averaged maximum MFI values and data was plotted versus antibody concentration. Binding affinities was calculated from the non-linear regression saturation binding equation, one site—total binding using Prism software. Results are shown in FIG. 10 and table 15.

TABLE 15 Cell Binding of Brentuximab Variants Cell Line Karpas L540 ADC Site Kd (nM) Kd (nM) Aglycosylated Brentuximab HC F404 0.46 0.81 DBCO-MMAF 2 Aglycosylated Brentuximab HC K121 0.52 1.3 DBCO-MMAF 2 Aglycosylated Brentuximab LC N152 0.50 1.0 DBCO-MMAF 2

Example 14: Antibody-Drug Conjugates in Alternative Scaffolds: scFv Formats

To demonstrate the feasibility of using scFv as alternative scaffold for ADC, DNA encoding trastuzumab scFv (VL_VH) with an amber codon was cloned into expression vector pYD317. The TAG codon was inserted by overlapping PCR mutagenesis at the nucleotides corresponding to the amino acid serine at the position (−1).

To express the scFv, cell-free extracts were thawed to room temperature and incubated with 50 uM iodoacetamide for 30 min. Cell-free reactions were run at 30 C for up to 10 h containing 30% (v/v) iodoacetamide-treated extract with 8 mM magnesium glutamate, 10 mM ammonium glutamate, 130 mM potassium glutamate, 35 mM sodium pyruvate, 1.2 mM AMP, 0.86 mM each of GMP, UMP, and CMP, 2 mM amino acids for all 18 amino acids except tyrosine and phenylalanine which were added at 0.5 mM, 4 mM sodium oxalate, 1 mM putrescine, 1.5 mM spermidine, 15 mM potassium phosphate, 100 nM T7 RNAP, 1.3 uM E. coli DsbC, 2 mM oxidized (GSSG) glutathione, 10 ug/mL scFv plasmid 15 uM in vivo produced m.j. tRNA, 5 uM m.j RNA synthetase and 1 mM pAzido Phenylanine (pN3F). Wild type scFv was expressed as control.

scFv(−1)_(p)N₃F and wild type scFv were purified by Protein L followed by SEC. 5 uM scFv (−1) was incubated with 50 uM of the DBCO-MMAF reagent shown in FIG. 1 for 16 hours at room temperature. The excess free drug was removed by zeba desalting column.

Hydrophobic Interaction Chromatography was then performed to quantitate the samples and to determine the drug-antibody ratios as follows. Samples and standards were diluted 1:1 in 3M Ammonium Sulfate (EMD Chemical), 50 mM Sodium Phosphate pH 7.0 (Mallinckrodt) prepared in MilliQ water. A Dionex HPLC system was equipped with a Tosoh Bioscience LLC TSK-gel Butyl-NPR® (4.6 mm×3.5 cm) column with a column compartment temperature of 30° C. The mobile phase A was 1.5M Ammonium Sulfate, 50 mM Sodium Phosphate, pH 7.0. The mobile phase B was 50 mM Sodium Phosphate, pH 7.0 in 80:20 water:isopropyl Alcohol (Honeywell). The mobile phase was delivered at a flow rate of 1.0 mL/minute. The separation was performed with a linear gradient of 15% mobile phase B to 100% mobile phase B in 10 minutes. The UV data was acquired at 214 nm. The peak areas were quantitated using Chromeleon software (Thermo) to calculate Drug Antibody Ratio (DAR).

To assess binding of the conjugated alternative scaffold variants, a binding assay to cells expressing HER2 was performed as follows. The binding of the purified conjugated variants to HER2 on SKBR3 cells, which overexpress the HER2/c-erb-2 gene product, with over 1.5 million receptor copies per cell (ATCC # HTB-30, Manassas, Va.) was compared to clinical grade Herceptin®, unglycosylated trastuzumab produced by cell-free protein synthesis, or human serum IgG1 as a negative control (Sigma-Aldrich; St. Louis, Mo.). SKBR3 cells were cultured in DMEM:Ham's F-12 (50:50), high glucose (Cellgro-Mediatech; Manassas, Va.) supplemented with 10% heat-inactivated fetal bovine serum (Hyclone; Thermo Scientific; Waltham, Mass.), 2 mM glutamax (Invitrogen; Carlsbad, Calif.) and 1× Pencillin/streptomycin (Cellgro-Mediatech; Manassas, Va.). Adherent cells were washed twice with calcium and magnesium-free Hanks Balanced Salt Solution (HBSS), harvested with HYQ® TASE™ (Hyclone; Thermo Scientific; Waltham, Mass.). A total of 200,000 cells per sample in total volume of 10 μL were incubated with serial dilutions of either conjugated alternative scaffold variants, clinical grade HERCEPTIN®, or aglycosylated trastuzumab made in 10 μL FACS buffer (DPBS buffer supplemented with 1% bovine serum albumin). Cells plus antibody or ADC were incubated for 60 minutes on ice. Unstained cells, human IgG1 (Isotype control) and Secondary antibody (goat anti-human IgG) were used as controls. To detect HERCEPTIN® or aglycosylated trastuzumab binding, cells were washed twice with ice-cold FACS buffer and incubated with 5 μg/ml Alexa 647 labeled goat anti-human IgG secondary antibody (Invitrogen; Carlsbad, Calif.) on ice for 1 hour. Alternative scaffold variant cell binding was detected by incubating cells on ice for 1 h with either 5 μg/ml Alexa 647 labeled Protein L (Pierce) or goat anti-human-Fc labeled with Alexa 647 (Invitrogen; Carlsbad, Calif.). All samples were washed using FACS buffer and analyzed using a BD FACS Calibur system (BD Biosciences; San Jose, Calif.).

Mean fluorescence intensities were fitted using non-linear regression analysis with one site specific binding equation using GraphPad Prism (GraphPad v 5.00, Software; San Diego, Calif.). Data was expressed as Relative MFI vs. concentration of antibody or antibody variant in nM.

Next, the effects of the conjugated alternative scaffold variants on cell killing were measured with a cell proliferation assay as follows. SKBR3 cells were obtained from ATCC and maintained in DMEM:Ham's F-12 (50:50), high glucose (Cellgro-Mediatech; Manassas, Va.) supplemented with 10% heat-inactivated fetal bovine serum (Hyclone; Thermo Scientific; Waltham, Mass.), 2 mM glutamax (Invitrogen; Carlsbad, Calif.) and 1× Pencillin/streptomycin (Cellgro-Mediatech; Manassas, Va.). Adherent cells were washed twice with calcium and magnesium-free Hanks Balanced Salt Solution (HBSS), harvested with HYQ® TASE™ (Hyclone; Thermo Scientific; Waltham, Mass.). A total of 10³ cells were seeded in a volume of 40 μl in a 96-well half area flat bottom white Polystyrene plate. The cells were allowed to adhere overnight at 37° C. in a CO₂ incubator. ADC variants were formulated at 2× concentration in DMEM/F12 medium and filtered through MultiScreen_(HTS) 96-Well Filter Plates (Millipore; Billerica, Mass.). Filter sterilized conjugated alternative scaffold variants, HERCEPTIN®, or aglycoslyated trastuzumab were added into treatment wells and plates were cultured at 37° C. in a CO2 incubator for 120 hrs. For cell viability measurement, 80 μl of Cell Titer-Glo® reagent (Promega Corp.; Madison, Wis.) was added into each well, and plates processed as per product instructions. Relative luminescence was measured on an ENVISION® plate reader (Perkin-Elmer; Waltham, Mass.). Relative luminescence readings were converted to % viability using untreated cells as controls. Data was fitted with non-linear regression analysis, using log(inhibitor) vs. response-Variable slope, 4 parameter fit equation using GraphPad Prism (GraphPad v 5.00, Software; San Diego, Calif.). Data was expressed as relative cell viability, ATP content % vs. dose of ADC in nM.

Results from the HIC analysis, the cell binding, and cell killing experiments are presented in Table 16.

TABLE 16 scFv Cell Binding, Cell Killing, and DAR SKBR3 SKBR3 Cell Cell Killing DAR by Binding Kd, HIC Scaffold IC50, nM nM Assay Unglycosylated Trastuzumab  5 NA NA scFv(−1) WT 15 NA NA scFv(−1) conjugate 14 0.48 0.77

Example 15: Antibody-Drug Conjugates in Alternative Scaffolds: scFv-Fc Formats

To demonstrate the feasibility of using scFv Fc as alternative scaffold for ADC, DNA encoding trastuzumab scFv Fc (VL_VH_Fc) with amber was cloned into expression vector pYD317. TAG codon was inserted by overlapping PCR mutagenesis at the nucleotides corresponding to the amino acid serine at the position (−1), Fc R355, Fc N389, and Fc F404 (EU index numbering).

CFPS reaction conditions were same as described in Example 14, except that 5 uM yeast PDI was added. Wild type scFv Fc was expressed as control.

To conjugate aglycosylated scFv-Fc, proteins were first purified by Protein A followed by SEC. 5 uM pN3F containing scFv-Fc was incubated with 50 uM DBCO-MMAF for 16 hours at room temperature. The excess free drug was removed by zeba desalting column.

Additionally, pAzido Methyl Phenylanine was incorporated to scFv Fc at the sites of R355, N389, and F404, respectively. The reaction condition was same as described in Example 14, except that 1 uM m.j. pCyano FRS and 1 mM pAzido Methyl Phe were used to replace the pair of m.j. FRS and pAzdio Phe. The protein A purified protein was then conjugated with DBCO-MMAF reagent 2 (DBCO-MMAF 2). The structure of DBCO-MMAF 2 is shown in FIG. 11.

Next, LC-MS was performed to quantitate the samples and to determine the drug-antibody ratios as follows. Samples were analyzed by liquid chromatography (CHIP-Cube nanoLC, Agilent) coupled to a qTOF mass spectrometer (Agilent 6520). Proteins were separated on a reverse phase HPLC-Chip (PLRP-S, 4000 Å, 5μ; Enrichment column: 25 mm; Separation column: 150 mm×75 m) with 0.1% formic acid in water (solvent A) and 0.1% formic acid in acetonitrile+isopropyl alcohol (80:20 v/v, solvent B). Data was processed using MassHunter Qualitative Software (Agilent). Data is shown in Table 17.

TABLE 17 DAR and Cell Binding and Killing of scFv-Fc variants SKBR3 SKBR3 DAR Cell Cell by Binding Killing LCMS Scaffold Kd, nM IC50, nM DAR Herceptin ® 12 NA NA Aglycosylated Trastuzumab 8.1 NA NA Her scFv-Fc 6.0 NA NA scFv-Fc (−1) pN₃F DBCO- 6.8 0.011 1.86 MMAF scFv-Fc (R355) pN₃F DBCO- 6.9 0.015 1.77 MMAF scFv-Fc (N389) pN₃F DBCO- 5.4 0.010 1.53 MMAF scFv-Fc (N389) pN₃CH₂F 3.0 0.052 1.97 DBCO-MMAF 2 scFv-Fc (R355) pN₃CH₂F 5.0 0.055 1.97 DBCO-MMAF 2 scFv-Fc (F404) vH-vL 0.05 1.99 pN3CH2 F DBCO-MMAF 2 scFv-Fc (F404) vL-vLH 0.068 1.97 pN3CH2 F DBCO-MMAF 2

In vivo stability of drug linker ADC conjugates was measured by dosing 2 mg/kg of respective scFv-Fc DBCO-MMAF 2 conjugates into Beige nude Xid mice. Plasma was collected by terminal bleeds at 30 mins, d3, d7, d14 and d21 from n=2 animals for each time point. Total circulating ScFv-FC ADCs were captured by Biotin-(Fab)₂ Goat Anti-Human IgG, Fcγ fragment specific. Complex was pulled down with streptavidin Mag Sepharose. Captured complex was washed to remove nonspecific binding and eluted with 1% formic acid. Neutralized eluate was analyzed by intact LCMS. The results indicate that scFv-Fc (N389) pN₃CH₂F DBCO-MMAF 2 and scFv-Fc (R355) pN₃CH₂F DBCO-MMAF 2 are stable up to 28 days.

Next, pharmacokinetics properties of ScFv-Fc ADCs were analyzed in beige nude xid mice. Animals were dosed intravenously at a dose level of approximately of 2.0 mg/kg of scFv-Fc R355 and scFv-Fc N389. The plasma concentrations, sampled out to 28 days, were determined by immunoassay and the pharmacokinetic parameters calculated using a non-compartmental approach with WinNonlin ‘v’ 5.2 (Pharsight, CA). Results are shown in FIG. 12.

To assess in vivo efficacy of the scFc-Fc ADCs, KPL-4 human breast tumor cells were inoculated into the mammary fat pads of SCID beige mice (Charles River Laboratories). A total of 3 million cells per mouse, suspended in 50% phenol red-free Matrigel (Becton Dickinson Bioscience) mixed with culture medium were injected. Once tumor size was reached all animals were randomly assigned into treatment groups, such that the mean tumor volume for each group was 100-150 mm³.

Trastuzumab (30 mg/kg) was given i.p. (single injection on treatment day 0), followed by (15 mg/kg) per week for 3 weeks. Aglycosylated Trastuzumab 2 nnAA variant scFv-Fc N389 DBCO-MMAF 2 15 mg/kg (872 ug MMAF/m2, DAR 1.901), Aglycosylated Trastuzumab 2nnAA variant HC-S136 (15 mg/kg) (unconjugated) Vehicle (PBS) and free drug (0.54 mg/kg) were given via i.v (single injection on treatment day 0). The results of this experiment are presented in FIG. 13 a.

In another experiment, scFv-Fc F404 vL-vH DBCO-MMAF 2 (15 mg/kg), scFv-Fc F404 vH-vL DBCO-MMAF 2 (15 mg/kg), Trastuzumab-CF HC F404 DBCO-MMAF 2 (15 mg/kg), Trastuzumab-CF (15 mg/kg), and vehicle were administered by single i.v. injection on day 0. The results of this experiment are presented in FIG. 13 b.

For both experiments, all treatment groups consisted of 10 animals per group, and tumor size was monitored twice weekly using caliper measurement. Mice were housed in standard rodent microisolator cages. Environmental controls for the animal rooms were set to maintain a temperature of 70° F., a relative humidity of 40% to 60%, and an approximate 14-h light/10-h dark cycle.

The results of this experiment demonstrate that the ADCs conjugated at positions N389 and F404 were effective in an animal to durably regress tumor size. Of particular note, the F404 scFv-Fc conjugates achieved regression below baseline, demonstrating particular efficacy for this scaffold in a solid tumor model.

Example 16: Exemplary ADCs Specific for CD74 and Her2 and with Additional Exemplary Linker-Warhead Combinations

This example provides exemplary antibody-drug conjugates prepared with an antibody specific for CD74 and containing different exemplary linker-warhead combinations as described herein. This example further provides exemplary antibody-drug conjugates prepared from trastuzumab conjugated with certain of the linker-warhead combinations.

The protein sequence of the heavy and light chains of the anti-CD74 antibody used in this example are provided as SEQ ID NO:5 and 6, respectively. The heavy chain sequence includes a 6-His C-terminal tag to assist with puridication. Amber codons at the positions encoding HC K212, HC S136, HC F241, HC F404, LCS7, or LC N152 were introduced using the methods described in Example 13. Anti-CD74 antibodies containing p-methylazido-Phe were expressed and purified using the methods described in Example 13. Trastuzumab variants containing p-methylazido-Phe at positions HC S136 or HC F404 were also expressed and purified as described in Example 13.

Next, the anti-CD74 or trastuzumab variants were conjugated with DBCO-MMAF 2 (FIG. 11A), DBCO-DM4 (FIG. 11B), DBCO-DM4 2 (FIG. 11C), or DBCO-MMAE (FIG. 11D). The method used to perform the conjugation reactions between the antibody variants and the linker-warhead combinations was as described in Example 13.

The drug-antibody ratios (DARs) of the ADCs was then determined by LC-MS according to the method described in Example 13. It should be noted that the LC-MS method for the anti-CD74 samples had not been optimized for this antibody; the results of the analysis contained peaks overlapping with unconjugated species, artificially lowering the DAR of these samples. The DAR of the ADC variants prepared in this example therefore showed good agreement with the ADCs containing a nnAA at corresponding sites discussed in Examples 1-15. Results of the DAR analysis are presented in Table 18, below.

Next, the ADC variants were used in cell killing experiments to determine their IC₅₀ values. Trastuzumab variants were analyzed as described in Example 13. The effects of the conjugated anti-CD74 variants on cell killing were measured by a cell proliferation assay. SU-DHL-6 and NCI-H929 cells were obtained from ATCC and maintained in RPMI-1640 medium (Cellgro-Mediatech; Manassas, Va.) supplemented with 20% heat-inactivated fetal bovine serum (Hyclone; Thermo Scientific; Waltham, Mass.), 2 mM glutamax (Invitrogen; Carlsbad, Calif.) and 1× Pencillin/streptomycin (Cellgro-Mediatech; Manassas, Va.). SU-DHL-6 and NCI-H929 cells were harvested and re-suspended in culture medium at final concentration of 0.5×10⁶ cells/mL. A total of 20×10³ cells in a volume of 40 μl were seeded in each well of a 96-well half area flat bottom white polystyrene plate. ADC variants were formulated at 2× concentration in RPMI-1640 complete medium and filtered through MultiScreen_(HTS) 96-Well Filter Plates (Millipore; Billerica, Mass.). Filter sterilized ADCs were added into treatment wells and plates were cultured at 37° C. in a CO₂ incubator for 72 hrs. For cell viability measurement, 80 μl of Cell Titer-Glo® reagent (Promega Corp. Madison, Wis.) was added into each well, and plates were processed as per product instructions. Relative luminescence was measured on an ENVISION® plate reader (Perkin-Elmer; Waltham, Mass.). Relative luminescence readings were converted to % viability using untreated cells as controls. Data was fitted with non-linear regression analysis, using log(inhibitor) vs. response-variable slope, 4 parameter fit equation using GraphPad Prism (GraphPad v 5.00, Software; San Diego, Calif.). Data was expressed as relative cell viability, ATP content % vs. dose of ADC in nM. Results of the cell killing assays are presented in Table 18.

TABLE 18 DAR and IC₅₀ Values for Exemplary Anti-CD74 and Anti-HER2 ADCs Antibody Drug Conjugate Site DAR IC₅₀ Anti-CD74 Antibody DBCO-MMAF 2 HC K121 1.601 0.122 Anti-CD74 Antibody DBCO-MMAF 2 HC S136 1.601 0.100 Anti-CD74 Antibody DBCO-MMAF 2 HC F241 1.58 0.078 Anti-CD74 Antibody DBCO-MMAF 2 HC F404 1.631 0.137 Anti-CD74 Antibody DBCO-MMAF 2 LC S7 1.543 0.120 Anti-CD74 Antibody DBCO-MMAF 2 LC N152 1.457 0.130 Anti-CD74 Antibody DBCO-DM4 HC K121 1.572 0.191 Anti-CD74 Antibody DBCO-DM4 HC S136 1.61 0.271 Anti-CD74 Antibody DBCO-DM4 HC F241 1.581 0.187 Anti-CD74 Antibody DBCO-DM4 HC F404 1.531 0.253 Anti-CD74 Antibody DBCO-DM4 LC S7 1.509 0.293 Anti-CD74 Antibody DBCO-DM4 LC N152 1.474 0.281 Anti-CD74 Antibody DBCO-DM4 2 HC K121 1.578 0.215 Anti-CD74 Antibody DBCO-DM4 2 HC S136 1.589 0.215 Anti-CD74 Antibody DBCO-DM4 2 HC F241 1.612 0.196 Anti-CD74 Antibody DBCO-DM4 2 HC F404 1.63 0.154 Anti-CD74 Antibody DBCO-DM4 2 LC S7 1.561 0.291 Anti-CD74 Antibody DBCO-DM4 2 LC N152 1.495 0.235 Anti-CD74 Antibody DBCO-MMAE HC K121 1.811 0.166 Anti-CD74 Antibody DBCO-MMAE HC S136 1.752 0.242 Anti-CD74 Antibody DBCO-MMAE HC F241 1.81 0.238 Anti-CD74 Antibody DBCO-MMAE HC F404 1.572 0.235 Anti-CD74 Antibody DBCO-MMAE LC S7 1.581 0.224 Anti-CD74 Antibody DBCO-MMAE LC N152 1.616 0.251 Trastuzumab DBCO-DM4 HC F404 0.036 1.84 Trastuzumab DBCO-DM4 HC S136 0.038 1.82 Trastuzumab DBCO-DM4 2 HC F404 0.043 1.94 Trastuzumab DBCO-DM4 2 HC S136 0.038 1.82

Example 17 further demonstrates that the site of incorporation of a nnAA into an antibody can transfer reasonably predictably between antibodies. Further, the identity of the linker and warhead does not appear to significantly affect the conjugation efficiency or DAR of the resulting ADC, demonstrating reasonable predictability that a given warhead-linker combination can be conjugated to the an antibody containing a non-natural amino acid at one of the preferred sites.

All publications and patent, applications cited in this specification are herein incorporated by reference as if each individual publication or patent application were specifically and individually indicated to be incorporated by reference. While the claimed subject matter has been described in terms of various embodiments, the skilled artisan will appreciate that various modifications, substitutions, omissions, and changes may be made without departing from the spirit thereof. Accordingly, it is intended that the scope of the subject matter limited solely by the scope of the following claims, including equivalents thereof.

Sequence Listing  <210> SEQ ID NO 1  <211> LENGTH: 449  <212> TYPE: PRT  <213> ORGANISM: Artificial sequence  <220> FEATURE:  <223> OTHER INFORMATION: sequence is synthesized  <400> SEQUENCE: 1  Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly    1               5                  10                  15  Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys                   20                  25                  30  Asp Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu                   35                  40                  45  Glu Trp Val Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr                   50                  55                  60  Ala Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser                   65                  70                  75  Lys Asn Thr Ala Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp                   80                  85                  90  Thr Ala Val Tyr Tyr Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr                   95                 100                 105  Ala Met Asp Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser                  110                 115                 120  Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser                  125                 130                 135  Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys                  140                 145                 150  Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala                  155                 160                 165  Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser                  170                 175                 180  Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser                  185                 190                 195  Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser                  200                 205                 210  Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys                  215                 220                 225  Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly                  230                 235                 240  Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met                  245                 250                 255  Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser                  260                 265                 270  His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val                  275                 280                 285  Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn                  290                 295                 300  Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp                  305                 310                 315  Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala                  320                 325                 330  Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln                  335                 340                 345  Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu                  350                 355                 360  Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe                  365                 370                 375  Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro                  380                 385                 390  Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly                  395                 400                 405  Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp                  410                 415                 420  Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu                  425                 430                 435  His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly                  440                 445  SEQ ID NO 2 T7 primer 5′TAATACGACTCACTATAGG 3′ SEQ ID NO 3 T7 primer 5′GCTAGTTATTGCTCAGCG3′ SEQ ID NO 4 >Sequence of brentuximab heavy chain: HC-6His MQIQLQQSGPEVVKPGASVKISCKASGYTFTDYYITWVKQKPGQGLEWIGWIYPGSGNTKYNEKFKGKAT  LTVDTSSSTAFMQLSSLTSEDTAVYFCANYGNYWFAYWGQGTQVTVSAASTKGPSVFPLAPSSKSTSGGT  AALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNT  KVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV  DGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVY  TLPPSREEMTKNQVSLICLVKGFYPSDIAVEWESNGQPENNYKTIPPVLDSDGSFFLYSKLIVDKSRWQQ  GNVFSCSVMHEALHNHYTQKSLSLSPGKGGSHHHHHH SEQ ID NO 5 >Sequence of brentuximab heavy chain: HC-6His MQIQLQQSGPEVVKPGASVKISCKASGYTFTDYYITWVKQKPGQGLEWIGWIYPGSGNTKYNEKFKGKAT  LTVDTSSSTAFMQLSSLTSEDTAVYFCANYGNYWFAYWGQGTQVTVSAASTKGPSVFPLAPSSKSTSGGT  AALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNT  KVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV  DGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVY  TLPPSREEMTKNQVSLICLVKGFYPSDIAVEWESNGQPENNYKTIPPVLDSDGSFFLYSKLIVDKSRWQQ  GNVFSCSVMHEALHNHYTQKSLSLSPGKGGSHHHHHH  SEQ ID NO 6 >Sequence of anti-CD74 heavy chain: HC M Q V Q L V E S G G G V V Q P G R S L R L S C A A S G F T F S S Y G M  H W V R Q A P D K G L E W V A V I W Y D G S N K Y Y A D S V K G R F T  I S R D N S K N T L Y L Q M N S L R A E D T A V Y Y C A R G G T L V R  G A M Y G T D V W G Q G T T V T V S S A S T K G P S V F P L A P S S K  S T S G G T A A L G C L V K D Y F P E P V T V S W N S G A L T S G V H  T F P A V L Q S S G L Y S L S S V V T V P S S S L G T Q T Y I C N V N  H K P S N T K V D K K V E P K S C D K T H T C P P C P A P E L L G G P  S V F L F P P K P K D T L M I S R T P E V T C V V V D V S H E D P E V  K F N W Y V D G V E V H N A K T K P R E E Q Y N S T Y R V V S V L T V  L H Q D W L N G K E Y K C K V S N K A L P A P I E K T I S K A K G Q P  R E P Q V Y T L P P S R E E M T K N Q V S L T C L V K G F Y P S D I A  V E W E S N G Q P E N N Y K T T P P V L D S D G S F F L Y S K L T V D  K S R W Q Q G N V F S C S V M H E A L H N H Y T Q K S L S L S P G K G  G S H H H H H H  

1. An Fc protein comprising a polypeptide chain having one or more non-natural amino acid residues at specific sites selected from the group consisting of optimally substitutable positions in the polypeptide chain, or a post-translationally modified variant thereof
 2. The Fc protein of claim 1 comprising a polypeptide chain having one or more non-natural amino acid residues at specific sites selected from the group consisting of heavy chain residues H404, H241, and H222 according to the EU index of Kabat, or a post-translationally modified variant thereof.
 3. The Fc protein of claim 1 that does not comprise a variable domain or a light chain.
 4. The Fc protein of claim 1 comprising a polypeptide chain having at least 70%, 80% or 90% homology to SEQ ID NO: 1 and having one or more non-natural amino acid residues at specific sites selected from the group consisting of sites corresponding to residues 404, 241, and 222 of the representative heavy chain polypeptide according to SEQ ID NO: 1, or a post-translationally modified variant thereof.
 5. The Fc protein of claim 1 wherein said polypeptide chain is a heavy chain of a type selected from the group consisting of a, 6, E and a.
 6. The Fc protein of claim 1 that corresponds to a class or subclass selected from the group consisting of IgA, IgA, IgA2, IgD, IgE, IgG, IgG1, IgG2, IgG3 and IgM.
 7. The Fc protein of claim 1 wherein said non-natural amino acid residue comprises a moiety selected from the group consisting of amino, carboxy, acetyl, hydrazino, hydrazido, semicarbazido, sulfanyl, azido and alkynyl.
 8. The Fc protein of claim 1 wherein each non-natural amino acid residue is according to the formula

wherein each L is independently a divalent linker; and each R is independently a functional group.
 9. The Fc protein of claim 8 wherein R is a reactive group, a therapeutic moiety or a labeling moiety.
 10. The Fc protein of claim 8 wherein each R is a reactive group selected from the group consisting of amino, carboxy, acetyl, hydrazino, hydrazido, semicarbazido, sulfanyl, azido and alkynyl.
 11. The Fc protein of claim 8 wherein each L is a divalent linker selected from the group consisting of a bond, alkylene, substituted alkylene, heteroalkylene, substituted heteroalkylene, arylene, substituted arylene, heteroarlyene and substituted heteroarylene.
 12. An Fc protein conjugate comprising the Fc protein of claim 1 linked to one or more therapeutic moieties or labeling moieties.
 13. The Fc protein conjugate of claim 12 that comprises said Fc protein linked to one or more drugs or polymers.
 14. The Fc protein conjugate of claim 12 that comprises said Fc protein linked to one or more labeling moieties.
 15. The Fc protein conjugate of claim 12 that comprises said Fc protein linked to one or more single chain binding domains (scFv).
 16. The Fc protein conjugate of any of claim 12 wherein said Fc protein is linked to said one or more therapeutic moieties or labeling moieties via one or more linkers.
 17. A composition comprising the Fc protein of claim 1, wherein said composition is substantially pure.
 18. A composition comprising the Fc protein of claim 1 wherein said Fc protein is at least 95% by mass of the total protein mass of said composition. 